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PREFACE 


This  report  was  originally  written  as  a  manuscript  for  a  paper  submitted  t o  IEEE  Transac¬ 
tion  on  Automatic  Control  in  June  1983.  In  the  process  of  modifying  the  manuscript,  we  had  to 
condense  its  technical  discussions  considerably  and  omit  many  details.  One  of  our  motivations 
for  writing  the  paper,  however,  was  our  recognition  that  the  multitarget  tracking  problem  is  a 
non-convent  ion  al  estimation  problem  and  needs  a  new  mathematically  sound  foundation.  Thus 
we  intended  in  this  report  to  present  the  complete  mathematical  descriptions  which  we  could  not 
include  in  the  article.  Naturally,  our  main  objective  in  this  report  was  to  make  every  step  in 
theoretical  development  as  clear  and  rigorous  as  possible,  even  at  the  expense  of  some  readabil¬ 
ity.  The  two  major  results  are  stated  in  the  form  of  two  theorems.  Several  details  have  been 
added  in  order  to  ensure  theoretical  completeness.  The  review  of  the  recently  published  NRL 
report  by  Dr.  I.R.  Goodman  has  been  added.  In  effect,  this  report  summarizes  our  efforts  at 
AJ.&D.S.  ,  from  1980  to  1984,  to  create  a  general  theory  of  multitarget  multisensor  tracking. 
Our  original  intention  of  writing  a  two-part  paper  remains;  in  the  near  future  we  will  complete 
Part  n,  which  will  contain  implement ational  issues  such  as  hypothesis  management  with  several 
illustrative  examples. 

The  theoretical  development  for  this  report  was  mainly  supported  by  the  Defense  Advanced 
Research  Project  Agency  under  contract  MDA903-8 1-0333.  The  documentation  preparation  was 
supported  by  the  continued  contract  MDA903-83-C-0333. 

August  1984,  Shozo  Mori 


•  S.  Mori,  C.-Y.  Chong,  E.  Tse  and  R.  P.  Wishner,  “Tracking  and  Classifying  Multiple  Targets  Without 
A  Priori  Identification.”  IEEE  Trans,  on  Auto.  Contr.,  Vol.  AC-31.  No.  5,  May  1986. 

••  In  1985  Advanced  Information  &  Decision  Systems  changed  its  name  to  Advanced  Decision  Systems 
(ADS). 
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Second  Preface 


A  final  draft  of  this  report  was  completed  in  August,  1984,  as  mentioned  above.  Due  to 
very  unusual  circumstance,  it  has  remained  in  that  state  for  almost  three  years.  Since  we  have 
cited  this  report  in  other  papers,  we  are  obliged  to  publish  it.  I  hope  that  this  process  will  be 
finalized  in  leh  immediate  future. 


April  1987,  Shozo  Mori 
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ABSTRACT 


Based  upon  a  general  target-sensor  model  which 
allows  dependence  among  targets  and  state-dependent 
target  detection,  a  Bayesian  solution  to  the  multitar¬ 
get  tracking  problem  is  derived.  When  this  solution  is 
applied  to  a  special  class  of  models,  a  less  general  but 
more  implementationally  feasible  class  of  algorithms 
is  obtained.  Representative  existing  algorithms  are 
then  compared  with  our  results.  By  doing  so^we^pro- 
v ides  a  unified  view  on  Bayesian  approaches  to  the 
multitarget  tracking  problem.  Part  I  covers  most  of 
the  analytical  results,  while  in  Part  II,  hypothesis 
management  and  other  issues  pertaining  to  implemen¬ 
tation  of  multitarget  algorithms  are  discussed  with 
several  examples. 
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NOMENCLATURE 


Explanation 


I  he  set  of  x  satisfying  condition  C . 

a  singleton:  the  set  whose  only  element  is  x . 

the  set  consisting  of  {xlt  •  •  ■  pcn}={xl  |l</<n}  (When 

n  =0,  the  set  is  empty.). 

the  standard  set  theoretic  notations:  member  of,  union,  intersec¬ 
tion,  subset  of  and  proper  subset  of. 

the  set  substruction  operation:  A \B={a£A  \b£B}. 

the  direct  product:  A  x  fl={(a  fr)  |a  &A  and  b(zB}. 

the  cardinality  of  a  set  -  #  (£)  is  the  cardinality  of  set  E ,  i.e.,  the 
number  of  dements  bdonging  to  E . 

a  function  defined  on  set  A  taking  values  in  set  B . 

indexed  tuple:  theoretically  equivalent  to  a  function  a  defined  on 
set  I  or  }. 

factorial:  n  !=l-2---n . 

function  composition  operation:  (/g)(*)=A£ CO)- 

the  indicator  function  of  set  E :  x(f  ;  E)  is  1  if  e£E  0  otherwise. 

the  semi-norm  on  a  Euclidean  space  defined  by  a  nonnegative 
definite  symmetric  matrix  A:  where  xT  is  the  tran¬ 

spose  of  vector  x. 


1 

) 

0 

D 


Ak 


MIJ) 


the  assignment  function  at  data  set  k:  a  random  function  defined 
on  a  subset  of  the  target  index  set  IT  £  taking  values  in 

JM(k)£  {1  where  NT  is  the  total  number  of  targets 

and  NM(k)  is  the  number  of  measurements  in  data  set  k. 

the  set  of  all  the  one-to-one  functions  defined  on  set  /  taking 
values  in  J . 
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$XT  (*  ) 
0FA&) 


Ck  O’  14) 


Dom(f) 
£(•), £(•!•) 
Fl(l-) 


/*(4) 


a  0-1  matrix  representing  the  constraint  in  the  0-1  integer  pro¬ 
gramming  form  of  hypothesis  evaluation. 

the  normalizing  constant  for  the  batch-processing  type  hypothesis 
evaluation  for  the  hypotheses  on  the  cumulative  data  set  Z<A  ^  in 
i.i.d .-Poisson  cases. 

the  normalizing  constant  for  the  modified  batch-processing  type 
hypothesis  evaluation  for  the  hypotheses  on  the  cumulative  data 
set  Z^KS>  in  i.i.d.-Poisson  cases. 

the  density  of  new  targets  (Reid’s  terminology  in  [4]). 

the  false  alarm  density:  (k)=upA  (k)/ fiSk(Y,k)  where  i/fA(k)  is 

the  expected  number  of  false  alarms  in  data  set  k  and  n  (Yti)  is 
the  scan  (search)  volume  of  sensor  s*. 

the  normalizing  constant  for  the  recursive  hypothesis  evaluation 
equation  at  data  set  k  in  general  cases. 

the  normalizing  constant  for  updating  the  number-of-target  distri¬ 
bution  at  data  set  k  in  general  cases. 

the  normalizing  constant  for  updating  the  target  state  distribution 
at  data  set  k  in  general  cases. 

the  normalizing  constant  for  updating  the  track  state  distribution 
conditioned  by  a  cumulative  data  set  Z  restricted  by  a  track  r 
with  a  measurement  value  y  in  ii.d.-Poisson  cases. 

the  objective  function  coefficient  vector  in  the  0-1  integer  program¬ 
ming  form  hypothesis  evaluation. 

the  domain  of  function  /. 

the  mathematical  expectation;  unconditional  and  conditional. 

the  transition  probability  of  the  target  states  in  general  cases  when 
the  number  N j  of  targets  is  n  . 

the  state  transition  probability  for  each  individual  target  in  i.i.d.- 
Poisson  cases. 
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the  density  of  extended  st  at  e-to-measu  remen  t  transition  probabili¬ 
ty  at  data  set  k:  g(y  |.t,A)  is  pM(y  | xji)pp{x  |/c)  if  y  (real  data 
--  target  detected)  1  —pp(x  (A)  otherwise  (no  data  —  target  un¬ 
detected). 


the  set  all  the  (data-to-data)  hypotheses  on  cumulative  data  set 

z(*>. 


the  state-to-measurement  matrix  in  linear-gaussian  measurement 
models. 


the  objective  function  in  the  0-1  linear  programming  form  of  hy¬ 
pothesis  evaluation. 


the  set  of  all  the  indices  for  the  targets  detected  in  data  set  k: 
/ Qf  ( k  )=Dom  (Ak  yZIj . 


the  target  index  set:  }  where  N?  is  the  total  number 

of  targets. 


the  cumulative  measurement  index  set  up  to  data  set  k  (when  the 
superscripts  are  dropped,  J  is  the  measurement  index  set  for  the 
currently  available  cumulative  data  set  and  J  for  the  same  minus 
the  most  recent  data  set). 


the  set  of  indices  for  all  the  false  alarms  in  data  set  k: 
JFA{k)=JM{k)\Im{Ak) 


the  measurement  index  set  at  data  set  k :  J M(k)={l,...J'l w(&)} 
where  NM(k)  is  the  number  of  the  measurements  in  data  set  k. 


the  set  of  indices  for  the  false  alarms  in  data  set  k  hypothesized  by 
a  hypothesis  X  given  NM(k}=m: 

l*Mi  €<l,.../n)|(j^)€U(r  |A:)>. 

rex 


indices  for  data  sets. 


the  likelihood  of  ( z(k ),X),  i.e.,  the  data  set  z(k)  and  the  data-to- 
data  hypothesis  X,  given  the  cumulative  data  sets  Z^-1^  and  the 
predecessor  hypothesis  X,  in  general  cases. 


the  likelihood  of  measurement  y  given  a  cumulative  data  set  res¬ 
tricted  by  a  track,  2 ^  ,  in  i.i.d .-Poisson  cases. 
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/a(^(A)) 


the  track  likelihood  of  track  r  given  'he  cumulative  data  set  Z(A> 1 

O 

at  data  set  K  in  i.i.d. -Poisson  cases. 


n-z(K)) 

\k 


X  ,  X 


the  modified  track  likelihood  of  track  r  given  the  cumulative  data 
set  Z{K)  in  i.i.d. -Poisson  cases  with  the  Poisson  number-of-false- 
alarm  assumption. 

the  random  collection  of  sets  of  measurement  indices  in  the  cumu¬ 
lative  measurement  index  set  J^k\  representing  the  measurements 
from  the  (real)  targets.  (Each  possible  realization  of  A*  is  called  a 
data-to-dala  hypothesis  on  Z^k\  The  subscript  k  is  dropped  when 
there  is  no  fear  of  confusion. 

dala-to-data  hypotheses:  X  is  the  unique  immediate  predecessor  of 
X. 


X  |  Z 


the  data-to-data  hypothesis  X  restricted  by  the  cumulative  data 
set  Z. 


^nrw 


V, 


the  set  of  newly  initiated  tracks  (hypothesizing  newly  detected  tar¬ 
gets)  in  data-to-data  hypothesis  X. 

the  set  of  previously  initiated  tracks  (hypothesizing  previously 
detected  targets)  in  data-to-data  hypothesis  X. 

the  hybrid  measure  on  the  measurement  value  space  Y ,  for  sensor 
s. 


NDT{k) 

NFA(k) 

NM(k) 


the  number  of  targets  detected  at  data  set  k: 
ndt  (k  )=#  (hr  (k  ))=#  (Dom  {Ak )). 

the  number  of  false  alarms  in  data  set  k. 

the  number  of  measurements  in  data  set  k. 


Nsrik) 


the  number  of  targets  which  are  detected  at  data  set  k  for  the  first 
time. 


Nt 

Nt 

"0 

(k  ) 


the  total  number  of  targets 

the  a  priori  upper  bound  on  the  total  number  of  targets  (possibly 
+oc  such  as  in  i.i.d.-Poisson  cases). 

the  a  priori  expected  number  of  targets  in  ii.d  .-Poisson  cases, 
the  expected  number  of  false  alarms  at  data  set  k. 
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Vk 

the  a  posteriori  expected  number  of  undetected  targets  at  data  set 
k. 

n* 

the  function  defined  on  taking  values  in  the  target  index  set 

Ij  si  Clk  uniquely  assigns  a  target  index  to  each  r  in 

\k.  (We  call  every  possible  realization  of  Clk  a  target-to-track  hy¬ 
pothesis.  The  subscript  is  dropped  when  there  is  no  fear  of  confu¬ 
sion.) 

,  D 

target-to-track  hypotheses:  hypotheses  on  the  realization  of  nk:  D 
is  a  restriction  of  ui. 

Pd 

the  detection  probability  function  in  general  cases. 

Pkk) 

the  a  posteriori  probability  of  each  data-to-data  hypothesis  at  data 
set  k  (hypothesis  evaluation  function). 

PM 

the  measurement  value  probability  density  function  in  general 
cases. 

PsfA 

the  number-of-false  alarm  function  in  general  cases. 

p&¥ 

the  a  posteriori  distribution  of  the  number  NT  of  targets  given  a 
data-to-data  hypothesis  at  data  set  k  in  general  cases. 

pkk) 

the  a  posteriori  target  state  distribution  given  the  number  of  tar¬ 
gets  and  a  data-to-data  hypothesis  at  data  set  k  in  general  cases. 

Prob. 

the  probability  measure  on  the  underlying  probability  space. 

PD 

the  detection  probability  function  in  i.i.d.-Poisson  cases. 

n  miX 

Pd 

the  maximum  value  of  the  detection  probability  function  in  i.i.d.- 
Poisson  cases. 

Pfa 

the  density  of  the  distribution  of  the  false-alarm  values  in  i.i.d.- 
Poisson  cases. 

Pm 

the  density  of  the  target-state-to-measurement  transition  probabil¬ 
ity  in  i  j .d . -Poisson  cases. 

P»fa 

the  numbei-of-false-alarm  probability  distribution  in  i.i.d.-Poisson 

cases. 

p,(  14) 

the  track  state  distribution  conditioned  by  the  cumulative  data  set 

restricted  by  a  track,  i.e.,  Zy. 
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N  omcndat  ure 


n 


IT 


Q(K) 

Q’o 

<?0 


Rk 

IR 

S 

T 

Sk 

|*-1 

T(Z(/t)) 


T 


W(X/») 


X 

Xn 


then-target  permutation  homeomorphism. 

a  permutation  on  {], . /i  }. 

a  labeled  partition  (Goodman’s  notation  in  [5]). 

the  a  priori  target  state  distribution  when  the  number  A'r  of  tar¬ 
gets  is  n  in  general  cases. 

the  a  priori  target  state  distribution  common  to  all  the  targets  in 
i.i.d.-Poisson  cases. 

the  error  variance  matrix  in  i.i.d.-Poisson  linear-gaussian  cases, 
the  set  of  'eal  numbers, 
the  set  of  sensors. 

a  super-sensor  (used  to  explain  the  PDA  and  JPDA  algorithms). 

the  sensor  which  generates  the  k- th  data  set. 

the  target  state  prediction  variance  in  i.i.d.-Poisson-gaussian  cases. 

the  set  of  all  the  tracks  on  the  cumulative  data  set  Z^k\ 

the  time  at  which  the  £-th  data  set  is  generated. 

the  symbol  for  "nothing,”  "no  target  present,"  or  "no  measurement 
generated.":  according  to  the  usual  convention,  the  set  of  all  the 
functions  defined  on  the  empty  set  taking  values  in  a  nonempty 
set  consists  of  a  single  element  whose  domain  and  range  are  both 
empty.  6  is  a  generic  symbol  for  such  a  special  function. 

a  track:  a  subset  of  a  cumulative  measurement  index  set  such  that 
it  contains  at  most  one  measurement  index  set  for  each  data  set. 

the  set  of  all  the  target-to-track  hypotheses  consistent  with  a 
data-to-data  hypothesis  X  and  the  condition  that  NT  of  targets  is 
n  . 

the  common  space  for  the  individual  components  of  target  states 
in  i.i.d.-Poisson  cases. 

the  target  stale  component  when  the  number  NT  of  targets  is  n  in 
general  cases. 
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X 


c 

/I 


x- 

X(t) 

■\(t) 

xk  |*-i 

x 


Y, 


y,ik) 


z  ,z , z{k) 


4 

Z(k) 


2Z+ 


the  space  for  the  state  component  common  to  all  the  targets  when 
the  number  Nj  of  targets  is  n  in  general  cases. 

the  space  for  the  collection  of  individual  components  of  target 
states  when  the  number  NT  of  targets  is  n  in  general  cases. 

the  individual  target  state  space:  Xj,=X),  x X‘n{n  times),  in  gen¬ 
eral  cases. 

the  target  states  at  time  t  in  general  cases. 

the  state  of  the  i  -th  target  in  i.i.d.-Poisson  cases. 

the  predicted  target  state  in  the  i.i.d.-Poisson-gaussian  cases. 

the  0-1  vector  representation  of  a  data-to-data  hypothesis  in  the 
0-1  integer  programming  form  of  batch-processing  hypothesis 
evaluation. 

the  measurement  value  space  for  sensor  s . 

the  measurement  value  assigned  to  track  r  at  data  set  k:y[r  |lt]  is 
yt  ( k )  (the  value  of  the  j  -th  measurement)  if  (j  J<.)£t  and  is  6  oth¬ 
erwise  (there  is  no  j  such  that  {j  Jc)£t). 

the  measurement  value  of  the  y -th  measurement  in  the  k- th  data 
set. 

cumulative  data  sets:  Z ^  is  the  cumulative  data  set  available  at 
k.  When  the  super  scripts  are  omitted,  Z  represents  a  currently 
available  cumulative  data  set  and  Z  is  the  same  minus  the  most 
current  data  set. 

the  cumulative  data  set  restricted  by  track  r. 
Zt={(y[T\k]Jc)\(z(k)*)eZ}. 

the  k- th  data  set:  z(k)=((yj  (k (k )jk  ),  where  NSi(k)  is 
the  number  of  measurements  and  yj(k)  is  the  value  of  the  j- th 
measurement  in  the  data  set.  The  /c-th  data  set  is  generated  by 
sensor  sk  at  time  tk.  The  &-th  data  set  is  occasionally  simply 
called  data  set  k . 

the  set  of  all  the  nonnegative  integers. 
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Section  I 


1.  INTRODUCTION 

Multitarget  tracking  is  a  problem  for  tracking  and  classifying  a  (generally  unknown) 
number  of  homogeneous  and/or  heterogeneous  objects  (called  targets  as  a  generic  name),  based 
on  origin-ambiguous  noisy  measurements  from  homogeneous  and/or  heterogeneous  sensors, 
where  some  targets  may  not  be  detected  and  some  measurements  may  not  originate  from  any 
object  of  interest  (false  alarms,  clutters,  etc.).  When  the  origin  of  each  measurement  is  known, 
tracking  and  classifying  are  reduced  to  joint  estimation  of  continuous  and  discrete  (respectively) 
parts  of  states  of  stochastic  processes  based  on  discrete-time  observation.  Multitarget  tracking 
problems  are  technically  very  interesting  because  they  may  require  formalism  which  cannot  be 
found  in  traditional  estimation  problems.  They  have  great  importance  due  to  their  wide  range 
of  applications  such  as  air  defense,  air  traffic  control,  ocean/battleCeld  surveillance,  etc.  In  the 
past  two  decades,  this  new  field  has  attracted  numerous  researchers.  Past  achievements  are  well 
documented  in  the  survey  paper  [1]  and  the  Naval  Ocean  Surveillance  Correlation  Handbooks, 
[2]  and  [3].  The  introductory  section  of  [4]  contains  a  short  but  excellent  survey  and  a  recently 
published  report  [5]  discusses  representative  past  results  extensively.  Despite  considerable  efforts 
spent  in  this  area,  the  present  stage  may  well  be  characterized  as  an  unorganized  collections  of 
numerous  "named"  or  "unnamed"  algorithms.  As  correctly  pointed  out  in  [5],  no  general  multi- 
target  tracking  concept  has  been  created.  This  failure  may  be  attributed  to  the  general  trend 
that  many  algorithms  are  developed  in  very  problem-specific  ways  and  based  on  special  assump¬ 
tions.  One  of  the  objectives  of  this  report  is  to  create  a  set  of  basic  concepts  upon  which 
researchers  can  communicate  with  each  other. 

In  a  recent  paper  [6],  an  attempt  was  made  to  create  a  unified  view  on  this  subject  by 
describing  the  multitarget  tracking  problems  in  terms  of  a  special  class  of  dynamical  systems, 
i.e.,  event-driven  linear  stochastic  systems.  In  multitarget  tracking  problems,  however,  when 
target  detection  and  measurement  assignment  to  targets  are  considered  as  an  "event”  process, 
such  a  process  is  clearly  driven  by  the  target  state  process,  not  vice  versa.  In  this  report,  we  will 
present  a  completely  different  view  on  the  subject.  We  contend  that  the  multitarget  tracking 
problem  is  "unconventional"  or  "non-classical"  in  the  sense  that  it  calls  for  a  new  type  of 
mathematical  modeling  and  formalism.  The  essential  features  of  the  problem  are:  (1)  The 
number  of  the  objects  to  be  estimated  is,  in  general,  random  and  unknown.  The  number  of 
measurements  in  each  sensor  output  is  random  and  a  part  of  observation  information.  (2)  Gen¬ 
erally,  there  is  no  a  priori  labeling1  of  targets  and  the  order  of  measurements  in  any  sensor  out¬ 
put  does  not  contain  any  useful  information.  For  example,  a  measurement  couple  (>'uv:)  fr°m  a 
sensor  is  totally  equivalent  to  (y2)yj).  When  a  target  is  detected  for  the  first  time  and  we  know 
it  is  one  of  n  targets  which  have  never  been  seen  before,  the  probability  that  the  measurement 
originating  from  a  particular  target  is  the  same  for  any  such  target,  i.e.,  it  is  1/n  .  The  above 
properties  (1)  and  (2)  are  properly  reflected  when  both  targets  and  sensor  measurements  are  con¬ 
sidered  as  random  sets  as  defined  in  (7],  We  will  tentatively  call  such  properties  random- set 
nature.  In  short,  according  to  our  view,  one  of  the  fundamental  aspects  of  multitarget  tracking 

1  By  this,  we  mean  that  the  targets  does  not  have  a  priori  identification. 

See  the  last  remark  in  Section  2. 
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problems  is  random-set  nature.  The  uncertainty  of  the  origin  of  each  measurement  in  every  sen¬ 
sor  output  should  then  be  imbedded  in  a  sensor  model  as  a  stochastic  mechanism  which  converts 
a  random  set  (set  of  targets)  into  another  random  set  (sensor  outputs). 

,  Theories  of  random  sets  or  stochastic  geometry  are  mainly  concerned  with  random  sets 

whose  realizations2  are  uncountable  sets  such  as  closed,  open  or  convex  sets  in  Euclidean  spaces, 
and  are  mathematically  highly  sophisticated.  Fortunately,  when  we  restrict  ourselves  to  random 
sets  whose  realizations  have  only  finite  members  with  probability  one,  we  can  still  apply  stan¬ 
dard  probabilistic  techniques.  For  example,  a  random  finite  set  X  of  reals  can  be  probabilisti¬ 
cally  completely  described  by  specifying  probability  Prob.  {#  {X )=n  }3  for  each  nonnegative  n 
•  and  joint  probability  distribution  with  density  pn(xir....pc„)  of  elements  of  the  set  for  each  posi¬ 

tive  n .  In  order  for  this  specification  to  be  appropriate,  however,  we  must  require  every  p„  to 
be  interchangeable  (permutable).  This  is  the  basic  approach  which  we  take  in  this  report.  As  in 
almost  all  the  existing  literature  on  multitarget  tracking,  the  basic  task  is  to  hypothesize  the  ori¬ 
gin  of  each  measurement  and  to  evaluate  all  the  possible  hypotheses,  or  in  other  words,' to  gen¬ 
erate  and  evaluate  all  the  possible  partitions  of  measurements.  To  accomplish  this,  we  will 
define  hypotheses  and  tracks  (two  terms  widely  used  but  often  vaguely  defined)  rigorously  and 
;  relate  them  to  particular  random  sets. 

In  the  next  section,  we  will  describe  a  general  class  of  target/sensor  models  which  form  the 
basis  for  the  new  general  multitracking  algorithm  described  in  Section  4.  Section  3  gives  defini¬ 
tions  of  hypotheses  and  tracks.  For  a  problem  statement,  Sections  2  and  3  are  quite  lengthy,  but 
necessary  so,  since  to  date  there  has  been  no  mathematically  rigorous  problem  statement  for  gen¬ 
eral  multitarget  tracking  problems,  and  some  confusion  seems  to  exist  in  basic  modeling.  In  Sec¬ 
tion  4,  a  general  recursive  formula  for  evaluating  hypotheses  is  derived  based  on  a  general  model. 
In  Section  5,  we  restrict  ourselves  to  a  more  limited  class  of  models,  i.e.,  what  we  may  call  i  j.d. 
(independent,  identically  distributed)  -  Poisson  (number  of  targets)  cases.  The  results  derived  in 
Section  5  are,  however,  general  enough  to  include  many  existing  algorithms  roughly  speaking  as 
a  subset,  as  seen  in  Section  6  where  we  compare  our  results  with  other  existing  algorithms.  Sec¬ 
tions  5  and  6  together  provide  a  set  of  implementationally  feasible  algorithms  and  a  unified  view 
on  existing  algorithms.  Part  I  covers  most  of  the  theoretical  issues,  whereas  Part  II  of  this 
report  describes  hypothesis  management  and  other  implement  at  ional  issues  with  simple  exam¬ 
ples. 


A 


D 

3 


2  In  this  report,  a  reali&tion  of  a  random  element  is  synonymous  with  a  sample,  i.e.,  a  particular  value  of 
a  random  element  when  a  point  in  the  underlying  probability  space  is  specified. 

3  For  any  set  A  ,  #  (A  )  is  the  cardinality  of  A  or  the  number  of  members  of  A  . 
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2.  MODELS  OF  TARGETS  AND  SENSORS 


I  i 

i 


j 


0 


2.1.  Target  Model  A  target  is  a  generic  name  for  the  least  unit  of  object  which  we  wish  to 
track  and/or  classify  and  which,  when  detected  by  a  sensor  at  a  certain  time,  may  generate  a 
measurement  in  the  sensor’s  output.  In  our  model,  all  the  targets  of  interest  are  modeled  as  one 
entity  rather  than  as  a  collection  of  individual  targets.  Formally,  a  target  system  state  at  time  t 
is  a  realization  (A” (/  )//r  (r ))  at  t  of  a  continuous-time  stochastic  process4  ( X  (t  )JJT  (/  )),€[,  *)  on 

a  target  system  state  space  U  (x„x{> j  })  which  is  the  formal  disjoint  union  of  a  system 

n  —0  '  ' 

(Xn)*.,  of  hybrid  sets.  By  a  hybrid  set,  in  this  report,  we  mean  the  direct  product  of  a  measur¬ 
able  set  (called  continuous  part)  in  a  Euclidean  space  and  a  finite  set  (called  discrete  part).  The 
use  of  hybrid  sets  allows  us  to  include  discrete  component  into  the  target  states,  e.g.,  different 
types  of  targets  (with  different  dynamics),  sudden  structural  changes  in  dynamics  (such  as 
maneuvers),  changes  in  operational  modes  (such  as  stop/ go  modes),  etc.,  in  addition  to  the  usual 
physical  (geolocational)  states  such  as  position,  altitude,  velocity  and  acceleration.  The  second 
element,  NT(t),  represents  the  total  number  of  targets  in  the  system  at  time  / .  When  n  =0,  X„ 
is  defined  as  { 6 }  where  8  is  merely  a  symbol  for  "no  target”  and  8£X„  for  all  n  >0.  The  follow¬ 
ing  assumption  is  made  for  the  stochastic  process  (. X (/  )Nt  (f  ))/e|/0po): 


Assumption  1:  (Constant  Number  of  Targets  and  Markovian  Property)  The  second  com¬ 
ponent  of  the  stochastic  process  ( X  fir)’  i-e.,  the  number  of  targets  N  T ,  is  constant  but  in  gen¬ 
eral  random  with  a  known  probability  distribution.  For  each  positive  integer  n,  given  NT=n , 
the  first  component  X  of  (X /Jj)  *s  a  temporally  homogeneous  Markov  process  on  X„  having 
initial  distribution 


QZ(dX)  =  Prob.  {X (l oKdX  j N T  —n  > 


(1) 


and  transition  probability 

F^(dX  \X)  =  Prob.  {X (t  + A t  )CdX  \ X(t)=XJJT~n}  (2) 

for  each  X£X„  ,  each  t  €[f  0>co)  and  each  A/  >0. 


Remark  1:  The  temporally  homogeneity  assumption  can  easily  be  removed.  However,  this 
assumption  reduces  notational  complexity  in  the  following  discussions.  One  should  not  confuse 
the  constant  number,  Nr,  of  targets  with  the  number  of  detected  targets  or  that  of  targets  in 

4  /0  is  the  time  when  no  sensor  has  begun  operating  yet . 
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the  sensors’  fields  of  view.  For  example,  when  the  targets  are  born  and  die  as  in  the  models 
described  in  [5]  and  [8],  we  can  imbed  an  appropriate  birth-death  process  into  the  target  dynam¬ 
ics.  hi  such  a  case,  N-j-  is  the  total  number  of  all  the  targets  which  once  exists  in  a  nonempty 
interval  included  in  [/ 0»co)-  Implicitly,  we  have  assumed  the  finiteness,  i.e.,  Prob.  (NT  <  oo)=l, 
which  correctly  reflects  the  reality5,  while  an  upper  bound  Rj  such  that  Prob.  {N^  <NT}=\ 
may  or  may  not  exist. 


In  order  to  model  the  random-set  nature  of  the  targets  as  a  whole,  we  need  some  additional 
structure  in  the  model:  First,  for  each  positive  integer  n,  we  assume  that  the  component  X„  of 
the  target  system  space  is  further  decomposed  as 

X„=XncxX'  (3) 

where  X?  is  the  space  for  the  common  target  state  for  all  the  targets  in  the  system  and 
X/=X{x  <  •  •  xX‘(n  times)  is  the  direct  product  of  n  identical  individual  target  state  spaces 
Xj,.  In  a  simple  example,  X,f={0}  (no  common  target  state  space)  and  every  XI  is  a  hybrid 

space  so  that,  for  each  n  and  t ,  when  Nr  =n ,  X (t  )=(x x(t  A„(0)  and  each  x,{t)  represents 

the  i-th  target’s  state  in  X^  at  time  t .  In  another  example,  X^=X'„=JR2  (-  the  set  of  pairs  of 

reals)6,  and  (x0,xlr . ^)  in  X„  represents  a  target  system  state  for  a  group  of  targets,  where 

Xq  is  the  hypothetical  centroid  and  each  x,  is  the  displacement  of  target  i  from  x0.  We  call  a 
function  ILX,,— *X„  an  n-target  permutation  homeomorphism  ( induced  by  a  permutation  n)  if,  for 
every  (Xc  ,(X,  )?_x)  in  X„  =X^x  X we  have 


n(*c  ,(X, ),"_,)  =  (Xc  ,(*„(, )),"_,)  (4) 

for  a  permutation  jt  on  {l,...,n}.  Then  we  can  state  our  interchangeability  requirement  for  tar¬ 
gets  in  the  following  way: 


Assumption  2:  (Target  Interchangeability  (1))  For  each  positive  integer  n,  initial  distribu¬ 
tion  Q o  and  state  transition  probability  are  interchangeable  (or  permutable )  with  respect  to 
the  individual  target  state  part  Xln  of  X„  ,  i.e., 

Qo  (Tl(dX))=QZ(dX)  (5) 


5  Consider  all  the  airplanes  in  the  world.  The  number  of  the  airplanes  which  have  been  built  by  the 
present  time  is  of  course  finite.  The  number  of  airplanes  which  will  be  built  in  the  future  is  also  finite 
although  a  probability-one  upper  bound  on  it  may  not  ecist. 

8  JR=(-oo,oo) 
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FnAl(n{dX)\U(X))  =  FlJ(dX  \X)  , 


for  any  n  -target  permutation  homeomorphism  n. 


This  assumption  states  that  there  is  no  a  priori  labeling7  of  targets  and  the  labeling  by 
means  of  integers,  {l,..,n  },  is  just  one  of  many  equivalent  ways  of  labeling.  Since  targets  do  not 
have  a  priori  labels,  sensors  can  not  treat  targets  with  a  priori  discrimination.  This  fact  must  be 
incorporated  into  any  sensor  model  as  will  be  done  below.  Thus  our  target  model  is  represented 
by  ( Prob .  {NT  =n  },Qq  ,(F^  >0)^.0  which  satisfies  Assumptions  1  and  2. 


2.2.  Sensor  Model  Let  5  be  a  finite  set  of  sensors.  Each  sensor  s  in  5  is  modeled  as  a  gen¬ 
eric  mechanism  which  observes  the  target  system  state  space  and  generates  a  finite  set  of  meas¬ 
urements,  called  data  sets8,  intermittently  according  to  a  certain  sampling  pattern.  Each  meas¬ 
urement  in  a  data  set  output  from  sensor  s  in  S  is  an  element  of  measurement  value  space  Ys 
for  sensor  s .  Each  Y,  is  also  a  hybrid  space  with  hybrid  measure  p, .  By  the  hybrid  measure 
on  a  hybrid  space,  we  mean  the  direct  product  measure  of  Lebesgue  measure  on  the  continuous 
part  and  the  counting  measure  on  the  discrete  part.  The  continuous  part  of  Y,  may  be  used  for 
analog  information  such  as  range,  azimuth,  elevation,  etc.,  whereas  the  discrete  part  may  be 
used  for  discrete  information  such  as  size/ cross-sect  ion  classification  of  aircraft  radar  images, 
wheeled/tracked  classification  of  ground  vehicles,  etc. 


Formally  a  data  set  is  a  random  dement  ((>>,  ),  4  j)  which  is  an  element  of 


U  U  (  (Y,  T  x  {m  }*  [/  o>°°)x  >) 

m  -0  teS 


and  represents  NM  measurements,  y j,.... ,  generated  by  sensor  s  at  time  1  . 
(Y,  r  =Y,  *  — x  y,  (m  times )  and  (Y, )°={0>  where  $  is  a  symbol  for  "no  measurement."  As  we 
did  for  targets,  we  have  included  the  number  NM  of  measurements  in  each  data  set  to  model 
objects  with  variable  number  of  members  explicitly.  The  purpose  of  the  inclusion  of  (r  s)  is  to 
make  clear  the  source  of  each  information.  Then  we  call  any  collection  of  such  data  sets  a 
cumulative  data  set.  The  following  six  assumptions  are  used  to  further  specify  our  generic  sensor 
model: 


7  See  the  last  remark  in  this  section. 

8  synonymous  to  scans,  frames,  return  sets,  etc. 
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Assumption  3:  (Finite  Sampling  Rate  anil  Known  Exact  Timing)  Every  sensor  generates  at 
most  one  data  set  at  any  given  time  t  £[r0,oc)  and  it  produces  only  a  finite  number  of  data  sets 
in  any  given  finite  interval  in  (/0,oo).  The  time  (/€[/0,oo))  and  the  source  (sGS)  tire  exactly 
known.  The  sensor  scheduling  is  completely  determined  by  each  sensor  and  is  assumed  to  be 
independent  of  the  target  system. 


Assumption  4:  (Conditional  Independence J  Given  the  target  system  stochastic  process, 
each  data  set  is  independent  from  other  data  sets  and  depends  only  on  the  target  system  state 
at  the  time  it  is  generated. 


Assumption  5:  (No  Merged  Measurement )  Each  measurement  in  each  data  set  may  or  may 
not  originate  from  a  target.  When  a  measurement  in  a  data  set  originates  from  a  target,  it  does 
not  originate  from  two  or  more  targets. 


Assumption  6:  (No  Split  Measurement)  No  target  generates  more  than  one  measurement  in 
any  data  set. 


Assumption  7:  (Random  Order)  The  order  of  the  measurements  in  any  data  set  contains 
no  information  about  targets. 


Assumption  8:  (Absolutely  Continuous  Distribution)  For  each  data  set  from  sensor  s  in  S, 
given  the  target  system  state,  the  set  of  detected  targets,  the  number  of  measurements  and  the 
assignment  of  the  detected  targets  to  the  measurements,  the  joint  distribution  of  measurement 
values  has  density  with  respect  to  hybrid  measure  p,. 


Assumption  1  implies  there  are  at  most  countable  data  sets  generated  in  (/0,oo).  Thus, 
throughout  the  rest  of  this  paper,  we  assume  that  the  data  sets  are  indexed  by  positive  integers 
as9,  z  (1)^(2)^  (3), . ,  where 


z(*)  =  ((y>  ik))^k)^M(kyksk) 


(?) 


is  the  A:-th  data  set,  in  such  a  way  that  tk<tk Whenever  k<k.'  We  will  call  each  z(k)  simply  data 
set  k.  Since  (tkJk)  is  assumed  to  be  exactly  known  (Assumption  3),  we  may  treat  them  as  non- 

9  Alternatively,  we  can  index  data  sets  by  their  third  and  fourth  components,  i.e.,  time  and  source  (t .s'), 
which  by  Assumption  3  can  identify  data  sets  uniquely.  This  alternative  indocing  is  useful  when  we  consid¬ 
er  target  svstem  state  estimation  based  on  different  cumulative  data  sets. 
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random  elements.  Assumption  4  then  should  be  interpreted  as  follows:  For  any  cumulative  data 
set  {;(!), . z{K  )},  we  have 


Prob •  )€(£*»<{(/* A)})}  | (A ’(/*))£, ,Nr)  = 

ft  ProZ).  ({z  (k  )e(Ek  x  {(/*  s* )})}  |  X  (/* )  Ar  ) 


where  each  £*  is  a  measurable  set  in  U  (FJ.)m  x{m  },  i.e.,  £*=U£rx{m}  with  each  £™ 

m  -0  *  m  -0 

being  a  measurable  set  in  (y,t)m.  Therefore,  modeling  sensors  means  specifying  each  factor  on 
the  right  hand  side  of  (8)  for  each  k . 

One  of  unique  notions  involved  in  multitarget  tracking  problems  is  "detection  of  targets”  or 
"origins  of  measurements,"  which  can  be  modeled  under  Assumptions  5  and  6  as  follows:  First  we 
define  a  random  set 

IT  ={/€Z+|1<«<Nt}  (9)10 

which  we  call  target  index  set.  For  each  k,  random  set 

(10) 

is  called  measurement  index  set  at  k.  Then,  for  each  k,  we  assume  a  random  function11  Ak 
defined  on  a  subset  of  IT  and  taking  values  in  7W(&).  We  call  such  Ak  assignment  function  at 
k.  ju=Ak(i )  means  that  the  i-th  target  is  detected  by  sensor  sk  at  time  tk  and  generates  the  j- 
th  measurement,  or  ;-th  measurement  originates  from  i-th  target.  Thus 

IDT(k)  =  Dom(Ak)  (11),: 

is  the  random  set  consisting  of  indices  for  targets  detected  by  sensor  sk  at  time  tk  (or  detected  in 
data  set  k)  and 


10  7L+  is  the  set  erf  all  the  nonnegative  integers. 

11  Usually  a  random  function  is  defined  as  an  random  elemtnt  whose  samples  share  the  same  domain  But 
the  domain  of  the  random  function  Ak  may  differ  from  sample  to  sample. 

u  For  a  function  /  ,  Dom  (J  )  and  Im  ( f  )  are  the  domain  and  the  range  of  /  . 
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JfaW  =JM(k)\Im(Ak)  (12)13 

is  the  set  of  indices  of  measurements  in  data  set  k  which  do  not  originate  from  any  target.  We 
call  such  measurements  false  alarms u.  Assumption  6  then  implies  that  any  realization  of  assign¬ 
ment  function  Ak  is  one-to-one.  For  the  later  use,  we  define  random  integers,  i.e,  the  number  of 
detected  targets  at  k. 


NDT(k)  =  #(IDT  (*)) 


and  the  number  of  false  alarms  in  data  set  k 


NFA{k)  =  #{JFA{k))  ■ 


Then  we  have  an  obvious  relationship 


KM(k)  =  NDT(k)  +  NFA(k) 


for  each  k . 


We  may  visualize  our  generic  sensor  measurement  generating  mechanism  as  the  following 
four-step  processes,  for  each  k: 


(1)  Detection:  For  each  nonnegative  integer  n ,  let  D(n )  be  the  collection  of  all  the  subset 

of  {1, . /i }.  D(n)={0}  when  n  — 0.  Apparently,  NT=n  implies  /z>r(*)€D(n )  for  each  k. 

Then  the  detection  mechanism  can  be  specified  by  the  detection  probability  function  defined  by 

Pd(D  \X/i*)  =  Prob.{IDT{k)=D  | X(tk)=XJJT=n}  (16) 


for  each  (D  fX  /i  )€UD(n)x  Xnx  {n}  and  each  k.  When  D£D(n),  the  right  hand  side  of  (16)  is 

n-0 


13  \  is  the  set  subtraction  operation,  i.e.,  fcr  two  sets,  A  and  B,A\B=*{a£A  \ a£B}. 

14  They  have  also  many  other  names  such  as  false  returns,  clutters  (in  case  of  radars),  nuisance  targets 
(not  real  targets),  etc. 
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JFA(k)=JM{k)\Im{AK)  (12)13 

is  the  set  of  indices  of  measurements  in  data  set  k  which  do  not  originate  from  any  target.  We 
call  such  measurements  false  alarms 14.  Assumption  6  then  implies  that  any  realization  of  assign¬ 
ment  function  Ak  is  one-to-one.  For  the  later  use,  we  deQne  random  integers,  i.e,  the  number  of 
detected  targets  at  k 

^Dr(k)  —  #  (^Dr(k))  (13) 

and  the  number  of  false  alarms  in  data  set  k 


NFA(k)  =  #(JFAk))  ■ 


Then  we  have  an  obvious  relationship 


NM{k)  =  NDT(k)  +  NFA(k) 


for  each  k . 


We  may  visualize  our  generic  sensor  measurement  generating  mechanism  as  the  following 
four-step  processes,  for  each  k: 


(1)  Detection:  For  each  nonnegative  integer  n ,  let  D(n  )  be  the  collection  of  all  the  subset 

of  {1, . /i}.  D (n)={0)  when  n=0.  Apparently,  NT=n  implies  Ior{k)€^(n)  for  each  k. 

Then  the  detection  mechanism  can  be  specified  by  the  detection  probability  function  defined  by 

Pd(D  \X/xJ<)  »  Prob.  {IDT(k)=D  \x(tk)=XJJT  =n  }  (16) 


for  each  (D/^)GUD(n)xXnx{n}  and  each  k.  When  D£D(n),  the  right  hand  side  of  (16)  is 

n-0 

0. 


13  \  is  the  set  subtraction  operation,  i.e.,  for  two  sets,  A  and  B  ,A\B  ={a£A  |a£B). 

14  They  have  also  many  other  names  such  as  false  returns,  clutters  (in  case  of  radars),  nuisance  targets 
(not  real  targets),  etc. 
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(2)  Number  of  Measurements:  Because  of  relationship  (15),  given  the  number  NDT(k)  of 
detected  taigets  in  data  set  k,  the  number  N^{k)  of  measurements  is  determined  by  the  number 
NFA(k)  of  false  alarms.  Its  statistics  can  be  specified  by  the  number -of -false  alarm  probability 
function, 


P s, Jin  \D  J(  ft  ft)  =  Prob.  {Nfa  (k )=m  \lDT  (k)=D  J  (t  k)=X  =n}  (17) 


for  every  (mJbJC/i  )€Z+x  ((J  D(n)xXnx{n})  and  for  each  k .  When  D£D(n  ),  the  right  hand 

n-0 

side  of  (17)  is  arbitrary. 


(3)  Random  Assignment :  According  to  Assumption  7,  given  the  set  IDT  (k )  of  detected  tar¬ 
gets  and  the  number  NM(k)  of  measurements  for  each  data  set  k,  the  assignment  of  the  origin  of 
each  measurement  is  totally  random.  In  other  words,  every  possible  realization  of  assignment 
Ak  is  equally  possible.  For  any  pair  (/  J )  of  finite  sets,  define 


A(IJ)  =  {a  -J— *J  |a  is  one— to— one.}  . 


(18) 


Then  #  (A  (I  J  ))  is 
any  k, 


'(#  when  and  0  otherwise- 


Therefore  we  have,  for 


Prob.  {Ak=a  | NM(k)=mJDT(k)=DX(tk)=XflT=n)  =  (19) 

m ! 


for  every  (D  /i  )€UD(n)xXnx  {n},  every  m>#(£>)  and  every  fl€A(£),{l . ,m}).  When 

n-0 

a  £A(D  ,{1, . fn  }),  the  left  hand  side  of  (19)  is  of  course  0.  It  is  arbitrary  if  m  <  #  (D ). 


(4)  Measurement  Values:  Through  the  above  three  steps,  we  have  determined  which  tar¬ 
gets  are  detected,  the  number  of  measurements  and  the  origin  of  each  measurement.  The  last 
step  is  to  assign  some  measurement  value  to  each  measurement  index  y  £•/*✓(*)  being  condi¬ 
tioned  by  (X(tk)J4T),  NM(k)  and  Ak.  By  Assumption  8,  we  specify  this  step  by  the  measure¬ 
ment  value  probability  density  function  defined  by 


Pm  (y  \o/nj: /if.  )psmJdy )  =  Prob.  {y  £dy  \Ak  =a  ( k  )=m  JC  (/*  )=X  AY  =n  }  (20) 


OO 

for  every  {X  ft  )€  U  X„  x  {n  },  every  (a  fn)  such  that  a  €A(D  ,{1  ,...fn  })  and  m>#  (D)  for  some 

n  - 1 
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DC{\,-..fi  },  and  every  y  £(YSt)m  ,  where  /i™  is  the  m-tuple  direct  product  measure  of  pSt. 


Thus  our  generic  sensor  model  is  specified  by  detection  probability  function  PD,  number- 
of-false-alarm  probability  function  PN  and  measurement  value  probability  density  function 

P.u- 


Remark  2:  We  should  note  that  that  the  above  three  functions  defining  the  generic  sensor 
model  in  (16),  (17)  and  (20)  are,  in  general,  all  conditioned  by  the  target  system  state  at  time  tk 
when  each  data  set  k  is  generated.  In  particular,  (16)  is  not  the  joint  probability  of  target 
detection  but  is  a  conditional  probability  of  target  detection  conditioned  by  the  target  system 
state.  Therefore,  no-measurement  data  set  (8,0jkjk),  i.e.,  "sensor  sk  is  up  at  time  tk  but  does 
not  observe  anything,”  is  at  least  potentially  as  informative  as  other  data  sets  with  positive 
numbers  of  measurements.  For  example,  for  a  sensor  monitoring  radio  communication  of  a  tar¬ 
get,  the  probability  of  detecting  the  target  is  zero  if  the  radio  equipment  of  the  target  is  turned 
off.  Thus  the  absence  of  any  reception  while  the  sensor  is  up  may  provide  vital  information  on 
the  state  of  a  target  in  some  cases.  In  another  example,  an  MTI-type15  radar  may  detect  targets 
only  when  the  absolute  values  of  targets’  radial  velocities  exceed  a  given  threshold.  In  such  a 
case,  the  absence  of  radar  return  around  the  position  where  a  target  is  expected  to  exist  may  be 
a  valuable  information  indicating  the  targets  stops  somewhere  in  the  vicinity  of  that  position.  It 
should  also  be  noted  that,  in  our  formulation,  every  data  set  contains  number-of-measurement 
information,  and  hence,  every  sensor  is  a  type-1  sensor  in  Reid’s  terminology  in  [4],  A  type- 2 
sensor  in  his  terminology  is  a  sensor  which  generates  data  sets  each  of  which  contains  at  most 
one  measurement,  i.e.,  NM(k)<  1  for  all  k,  and  is  not  treated  separately  in  our  formulation. 


Remark  3:  For  sensor  systems  with  measurement  time  delays  dependent  on  the  target 
state  (e.g.,  acoustic  sensor  systems  described  in  (9j),  a  simple  model  where  the  target  state  is  a 
pair,  (posit  ion  .velocity),  may  violate  Assumptions  3  and  4,  and/or  other  assumptions.  In  such  a 
case,  in  order  for  our  formulation  to  be  applicable,  careful  modeling  is  necessary  so  that  all  the 
assumptions  made  so  far  are  valid  at  least  in  an  appropriate  sense  of  approximation.  Although 
Assumptions  5  and  6  are  quite  standard  in  the  multitarget  tracking  literature,  their  relaxation 
may  be  required  in  some  applications.  Assumption  5  may  be  relaxed  by  assuming  a  random  col¬ 
lection  of  subsets  of  target  index  set  IT  and  by  considering  each  of  such  subsets  as  a  potential 
origin  for  a  measurement  for  each  data  set.  Recently  an  algorithm  which  allows  measurements 
to  be  merged  has  been  developed  in  [10].  At  this  moment,  however,  it  is  not  clear  how  to  relax 
Assumption  6.  In  this  report ,  we  will  not  attempt  to  relax  these  assumptions.  Another  way  to 
state  Assumption  7  is  that  a  data  set  is  the  unit  of  sensor  data  in  which  the  order  of  measure¬ 
ments  in  it  does  not  contain  any  information  about  the  targets.  In  some  cases,  however,  the 
order  of  measurements  from  a  set  of  returns  may  contain  information  about  targets;  Such  may 

15  MTI  -  Moving  Target  Indicator 
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be  a  case  when  a  radar  uses  a  particular  scanning  pattern.  In  such  a  case,  the  data  set  should 
be  further  divided  to  the  point  where  the  measurement  order  does  contain  any  significant  infor¬ 


mation  about  targets. 


Finally,  in  order  to  reflect  the  random-set  nature  of  targets  correctly,  we  must  require  the 
following  interchangeability  assumption  in  addition  to  Assumption  2. 


Assumption  8:  (Target  Interchangeability  (2))  Detection  Probability  function  PD,  number- 
of-false-alarm  probability  function  and  measurement  value  probability  density  function  PAf 
are  all  invariant  under  target  permutation,  i.e.,  PD(n(D)\Tl(X)/i),  P^FA(m  | n(D ),ri(/f )ji )  and 
(y  \a,/n  ,U(X)/i)  are  all  invariant  with  respect  to  any  n-target  permutation  homeomor- 
phism  n  induced  by  a  permutation  it  on  },  where 


n(D)  =  {n(i )|»  gD} 


and,  when  a  £A(7  J ),  a  w:ir(Dom  (a  ))— *7  is  defined  by16 


fl,(i)  =  (a.rr"1XO  =  fl(^  ’(/)) 


for  all  i  £t:(Dom  ( a  )). 


Remark  4:  We  may  characterize  the  two  target  interchangeability  assumptions.  Assump¬ 
tions  2  and  8,  as  target  without  a  priori  identification,  although  the  use  of  term,  "identification," 
may  well  be  controversial  since  it  is  used  in  other  areas  of  control  theory  with  very  distinctive 
meaning  such  as  system  identification,  parameter  identification,  identification  of  weighting  func¬ 
tions,  etc.  Suppose  that  the  total  number  of  targets  is  n  and  that  there  are  m  detected  targets 
(of  course  m<n).  Then,  with  the  interchangeability  assumptions,  any  one  of  n  V(n  — m)!  possi¬ 
ble  ways  of  associating  the  m  sets  of  data  to  the  n  targets  is  equally  likely.  This  means  that 
the  targets  are  not  discriminated  from  each  other  Q  priori  or  they  are  not  identified  a  priori.  On 
the  other  hand,  if  each  target  index  has  a  specific  meaning  identifying  a  specific  target,  we  may 
say  the  targets  have  a  priori  identification.  This  concept  of  target  identification  should  also  not 
be  confused  by  "target  classification.”  Almost  all  the  "real"  targets  have  unique  identification, 
such  as  name,  number  plates,  manufactory  serial  numbers,  etc.  However,  in  a  target  model 
where  the  targets  do  not  have  a  priori  identification,  such  identification  is  not  considered  within 
the  model.  Therefore  the  targets  are  labeled  or  discriminated  with  each  other  only  through 
measurement  indices.  In  such  a  model,  the  target  classification  is  to  determine  the  targets' 
discriminants  as  much  as  possible  and  the  ultimate  classification  is  to  determine  the 

16o  is  the  function  composition  operator  and  tt-1  is  the  inverse  function  of  rr. 
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identification  of  each  target  by  means  of  a  name,  a  registered  number,  etc.,  whenever  it  is  possi¬ 
ble.  We  will  see  more  discussions  on  this  subject  in  the  next  section  as  well  as  in  a  part  of  Sec- 
t  ion  6. 


3.  TRACKS  AND  HYPOTHESES 

Although  tracks  and  hypotheses  are  among  most  frequently  used  terms  in  the  multitarget 
tracking  literature,  they  arc  not  often  explicitly  defined.  Our  definitions  of  tracks  and 
hypotheses  introduced  in  this  section  closely  follow  Morefield’s  notation  in  [11]  but  differ  in  one 
crucial  aspect,  namely  separation  of  the  measurement-value  information  from  the  number-of- 
measurement  information  in  each  data  set.  In  other  words,  we  will  form  tracks  and  hypotheses 
on  the  set  of  measurement  indices  rather  than  directly  on  the  collection  of  data  sets. 

First  we  will  remove  the  measurement-value  information:  For  the  moment,  let  Z  be  the 
cumulative  set  up  to  K,  i.e., 


Z  =  {z(k)\l<k<K} 


(21) 


Then  we  call 


J  «  U  A,  (*)*{*} 

km\ 

-  U  {1, . 


the  cumulative  measurement  index  set  at  K  (or  up  to  K  or  associating  the  cumulative  data  set 
Z).  Then  the  cumulative  data  set  Z  represents  all  the  information  available  up  to  and  including 
the  most  recent  data  set  z(K).  Every  (j  Ji)  in  the  cumulative  measurement  index  set  J 
represents  the  y-th  measurement  in  data  set  k  generated  by  sensor  sk  at  time  tk.  Define  a  ran¬ 
dom  collection  of  subsets  of  J  by 


A  =  {  {(j  Ji)\i  =Ak(i )  and  \<k<K  }  1 1  <i  <NT  }  \  {0} 

=  {Tk(i )  &  {(J  *)\j  =Ak(i )  and  \<k<K  }*0  1 1<»  <NT  } 
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Each  member  of  A  is  a  set  of  measurement  indices  which  are  included  in  the  cumulative  meas¬ 
urement  index  set  J  and  represent  all  the  measurements  originating  from  a  target  indexed  by  an 
/  •  Since  each  Ak  is  a  random  one-to-one  function  (corresponding  to  the  no-split /merged- 

measurement  assumption,  i.e.,  Assumptions  5  and  6),  it  follows  from  definition  (23)  that  A  must 
satisfies  the  following  conditions: 

(i)  Each  member  of  A  is  not  empty  and  contains  at  most  one  measurement  index  for  each  data 
set . 

(ii)  Any  two  members  of  A  do  not  share  a  common  measurement  index  at  any  data  set. 

Then  we  call  any  possible  realization  X  of  A  a  data-to-data  association  hypothesis  or  data- 
to-data  hypothesis  or  simply  hypothesis  at  K  or  on  the  cumulative  data  set  Z.  Let  the  set  of  all 
the  data-to-data  association  hypotheses  at  K  be  denoted  by  H(Z).  On  the  other  hand,  any  sub¬ 
set  r  of  cumulative  measurement  index  set  J  is  a  possible  "trace"  of  a  target  (detected  in  at  least 
one  data  set  up  to  and  including  K)  and  called  a  track  at  K  or  on  cumulative  data  set  2  if  it 
contains  at  most  one  measurement  for  each  data  set  up  to  and  including  K .  The  set  of  all  the 
tracks  on  cumulative  data  set  2  (or  at  K)  is  denoted  by  T (2).  Namely,  we  have 

T (2)  =  {rCJ  |#(r|A)<l  for  aU  k  such  that  l<k<K}  (24)17 

and 

/  ,7-ifV2=0  for  all 

H(2)  -  {XCT(Z)\{0>  |(,1>Tj)€XxX  such  lha,  ri„J  ■  <!5> 

In  (24)  and  other  subsequent  equations,  we  define 

r\k  =  {/€Z+ !(/,*)€'}  (26) 

for  any  rC2Z+x  Z£+.  We  should  note  that  the  empty  collection  0  is  included  in  H (2).  We  call 
such  hypothesis  null  hypothesis,  which  hypothesizes  that  all  the  measurements  in  cumulative 
data  set  2^  are  false  alarms.  Also  T(Z)  includes  the  empty  set  0  which  we  call  null  track  and 
hypothesizes  a  target  not  detected  in  any  of  data  sets  up  to  and  including  K . 

Given  a  cumulative  data  set  2,  each  XgH(2)  hypothesizes  the  following  events: 

(1)  There  are  #  (X)  targets  which  have  been  detected  and  generated  at  least  one  measurement 
in  at  least  one  data  set  up  to  and  including  K .  This  means  number  NT  of  targets  is  at 
least  #  (X). 

(2)  Each  track  r  in  X  corresponds  uniquely  to  a  target  which  have  been  detected  at  least  once 
up  to  and  including  to  K  . 

17  The  definitions  (24)  (or  (25),  respectively)  can  be  easily  ©ttended  to  the  set  of  all  the  hypotheses  (or 
tracks,  resp.)  on  an  arbitrary  cumulative  data  sel  rather  than  Z  defined  by  (21). 
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(3)  (j  A)€t  for  a  track  r  in  X  means  that  the  j  -th  measurement  in  data  set  k  originates  from  a 
target  identified  by  track  r. 

(4)  For  any  k<K  and  t£\,  r|&=0  means  that  the  target  identified  by  r  is  not  delected 
(falsely  dismissed)  and  hence  generates  no  measurement  in  data  set  k. 

(5)  ./\(UM  is  the  set  of  all  the  measurement  indices  of  false  alarms  in  ail  the  data  sets  up  to 
and  including  to  K  . 

It  follows  from  the  above  discussions  and  the  definitions  A  that 


Prob.  {A€H(Z)  |Z}  =  Prob.  {A€H(Z)  |/> 

=  E  Prob.  {A=X  \Z) 

>.<=H(Z) 

=  E  Prob.  {A=X|/> 

X€H(Z) 


which  means  that  set  H(Z)  of  all  the  hypotheses  on  cumulative  data  set  Z  is  a  mutually  distinct 
and  collectively  exhaustive  set  of  "explanations"  of  the  origins  of  all  the  measurements  in  cumu¬ 
lative  data  set  Z.  We  should  note  that,  in  order  to  generate  all  the  hypotheses  in  H(Z),  we  only 
need  cumulative  measurement  index  set  J  or  equivalently  number-of-measurement  sequence, 

.  The  objective  of  the  rest  of  this  report  to  evaluate  each  hypothesis,  i.e.,  to 

calculate  its  a  posteriori  probability  Prob.  {A=X  |Z}  and  to  infer  the  target  system  state 
(X (tk)fl T)  under  each  hypothesis.  We  need  a  few  preparations  before  the  next  section  in  which 
the  main  result  of  this  report  is  stated. 

First  define  a  random  subset  of  target  index  set  I r  by 


/  =  U  Dom(Ak)  =  U  fDT(k) 


which  is  the  set  of  indices  for  all  the  targets  detected  at  least  once  in  a  data  set  up  to  and 
including  K  and  is  called  cumulative  detected  target  index  set.  Then  definition  (23)  of  A  implies 
#(/)=#( A).  Although  A  "explains"  all  the  origins  of  measurements  in  cumulative  data  set  Z, 
it  does  not  tell  us  which  track  comes  from  which  target.  In  other  words,  the  target  labeling  by 
{l,..//r}  is  lost  in  the  set  operation  in  (23).  This  uncertainty  can  be  represented  by  another 
random  function  Q  which  is  defined  on  A  and  taking  values  in  If.  For  each  r  in  A,  Q(r)  is 
defined  as  an  i  such  that 


r  =  {( j  rk)€-T  I  j  =Ak(i )  and  1  <k<K} 
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Since  each  /l*  is  a  random  one-to-one  function,  such  an  /  is  unique,  and  hence,  ft  is  well  defined. 
Moreover  any  realization  of  ft  is  also  one-to-one  and  /;n(ft)=7.  We  call  any  realization  of  ft  a 
target-to-track  hypothesis  on  Z  and  any  realization  (u,X)  of  (ft, A)  a  target-to-data  hypothesis  on 
Z.  Let  \V(X/j)  be  the  collection  of  all  the  target-to-track  hypotheses  under  assumptions  A=X 
and  Nr  =rt ,  i.e., 

W(X,n )  =  {w  :  X— }  |lj  is  one  —  to—  one.  }•  .  (30) 

We  should  note  that  pair  (ft, A)  is  nothing  but  another  form  of  representation  for  the  assignment 
tuple  (Ak)k_v  This  is  so  since,  for  each  k ,  we  have 

Dorn  (Ak )  =  IDT  (k )  =  {i  elm  (ft)  |  fr\i )  |  A  *0}  (31 ) 


j  =  Ak(i)  if  and  only  if  {/  }  =  ft_1(/ )  |A  . 


As  a  consequence  of  our  target  interchangeability  conditions,  we  have  the  following  important 
lemma: 


Lemma  1:  For  each  X£H (Z),  for  each  n  >#  (X)  and  each  w£W  (X/i ),  we  have 


Prob.  {ft=w  |  h—\fiT  =n  }  =  #  (W  (X/t  J)’1  =  . 

n ! 


Proof:  Since  W(X^a)  contains  till  the  target-to-track  hypotheses  when  conditioned  by  A=X 
and  Nj=n  ,  we  have 

S  Prob.  {ft=w  |  A  =X  =n  }  =  1  .  (34) 

wew(x^)  ' 

On  the  other  hand,  if  Wi€W(X,n)  and  W2€W(X/j),  there  exists  a  permutation  tt  on  {1,../j  }  such 

n  ® 

that  or  Wi(f)=tr((j2(f))  for  ah  rgX.  Since  #(W(X/j))=- - ,  to  show  (34),  it  suf- 

(n  -#  (X))! 

fices  to  show  that  Prob.  {ft=s  w  |  A=X/7r  }  or  equivalently  Profc.  {ft=a  w,A  =X|Nr}  is  invariant 
under  any  permutation  7r  on  {l,.../i}.  Since  (ft, A)  is  one-to-one  to  (Ak)km  1(  there  exists  (a*)/*.) 
such  that 


k 

| 


ft 

8 

❖ 

I 


i 

8 


v T^frrwvFv^i  nnn 


Section  3 


Ktay  12,  1988 


{ft-w,A=X}  = 

K  -  i 


TR- 1048-01 


Thus,  using  Assumption  4  (Conditional  Independence),  we  have 


Prob.  {Q=u,A=\  |  =n  } 


=  f  Prafc{n=WfA-X|(*(/*))&i-to^^^  |^r=«} 

K»1 

x. 

=  f  Prob.  ( n  {Ak=ak}  | (JV (tk))k-i=(Xk)k-\’NT  ~n  )  Prob.  {(X (tk))km\€f\dXk  \NT=n} 

/c-I  Ar— 1 

X. 

-  f  ft  Prob.  {Ak  =ak  \X(tk )=Xk  -n  }  Prob.  {(X (/*  ))£  i6  ft dXk\NT=n) 

J  k-\  ktm  1 


Using  (16),  (17)  and  (19),  we  have,  for  each  k , 


Prob.  {Ak=ak  \X (tk)=Xk  J4T=n}  =  PD ( Dom (ak)\Xk/i) 


(  £  - - ”?L  (m  |  Dom  (ok)JCk  si )) 

m  •max {Im  (at)}— #  (/m  (at))  (m  +#  (7/71  (fl*  )))! 


Therefore,  it  follows  from  the  target  interchangeable  assumptions,  Assumptions  2  and  9,  that 
Prob.  {fl=s  w,A=X  \Nt  =n  }  is  invariant  with  respect  to  7r.  Q-E.D. 


This  lemma  is  very  important.  First  this  lemma  states  that  we  do  not  have  to  evaluate 
every  target-to-data  hypotheses  (oj(A)  instead  evaluation  of  every  data-to-data  hypothesis  is  suffi¬ 
cient.  Secondly  this  lemma  gives  a  hint  as  to  what  should  be  the  appropriate  set  of  variables  to 
propagate  forward  when  we  evaluate  each  hypothesis  recursively.  The  importance  of  this  choice 
may  be  illustrated  in  the  following  very  simple  example: 


Example  1:  Assume  Prob.  {NT  =2}-l,  X2={5i,52},  and  targets  are  stationary  and  i.i.d.. 


Qo2(E)=  £  fl(i,)?(«2) 

(' 


with  some  q  :  Xi— 1 >(0,1)  such  that  q  (S  {)+q  {S  i)=l .  Suppose  we  have  a  data  set 


Pv 

I 

ft 


",‘.VWV • 
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-(1)=(5'i,52^/ i^i)  from  a  sensor  Sj  at  time  /j>/0  w''h  2}  | A' ^2,1  )=1 , 

PnFa(Q  \DJf2,l)=l,  and  PM (XaWrXa{2) \a  ,2,(A'i,A'2),2,l)  =  3,  where  yj={51^S:}.  Namely,  the 
probability  detection  is  one,  no  possibility  of  false  alarms  and  the  measurement  error  is  zero. 
Let  Z={c(l)}.  Then  apparently  Prob.  {\-\  |Z}=1  where  X={r,,r2},  r,={(l,l)}  and  r2={(2,l)}. 
Then,  by  Lerv.ma  1,  Prob.  | A  =X,,Vr  =2}- V2  for  each  one-to-one  function  u  from  X  to 

{1,2}.  Thus,  we  have 


Prob.  {X(t  xMSiJSz) \Nt=2,A=XZ}  =  Prob.  {X(t  ,)=(S2^,)  |Nr=2,A=X,Z}  =  V2  . 


Namely  conditional  distribution  of  targets  is  no  longer  independent  although  the  a  priori  distri¬ 
bution  is  independent.  This  is  a  direct  consequence  of  our  assumption  of  no  merged  measure¬ 
ment,  i.e,  if  a  target  is  at  Sj  the  other  target  must  be  at  S2,  and  vice  versa.  The  mixing  of  the 
target  state  distribution  with  all  the  realizations  of  fl,  which  are  all  equally  likely  because  of  the 
target  interchangeability  assumptions,  thus  turns  the  independent  a  priori  distribution  into  a 
posteriori  distribution  with  cross-correlation.  In  this  particular  example,  the  a  posteriori  distri¬ 
bution  is  still  interchangeable.  In  general,  however,  this  kind  of  mixture  may  create  very  compli¬ 
cated  a  posteriori  distributions.  On  the  other  hand,  if  we  conditioned  the  state  distribution  by 
one  of  the  equally  probable  realization  of  Cl,  then  the  a  posteriori  distribution  is  still  independent 
although,  in  this  particular  example,  the  distribution  is  degenerated. 


This  example  suggests  that 


Prob.  {X(tx)edX  | NT=n  ,A=X,Z> 

=  £  Prob.  {X(t  x  )£dX  \NT=n  ,n=^,A=X^>  Prob.  {CP=u\Nt  =n  ,A=X,Z} 

w€W(X/.) 

=  £  Prob.  {X {tK )edX  \Cl=uflT=n  ,A=X,Z) 

nl  wew(X/i) 

may  not  be  in  an  appropriate  set  of  variables  to  propagate  forward  when  we  evaluate 
hypotheses  recursively  and  infer  the  target  state  (X  fl?)  accordingly.  We  may  say  the  above 
mixture  is  an  unnecessary  "over-aggregation"  of  information.  For  this  reason, 


Prob.  {X(tK)£dX  |n=w,A=X,Nr,Z} 


is  more  appropriate  as  shown  in  the  next  section. 

Before  we  close  this  section,  we  will  investigate  the  relationship  among  tracks  and 
hypotheses.  For  this  purpose,  we  consider  all  the  cumulative  data  sets  and,  to  explicitly 

17 
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represent  the  difference,  we  denote  the  cumulative  data  set  up  to  K  by  Z^K\  The  corresponding 
cumulative  measurement  index  sets  are  denoteed  by  J^K\  Then  such  cumulative  sets  (and 
aimulative  measurement  index  sets)  are  finitely  or  at  most  countably  many.  Then  apparently 
we  have  the  inclusion  relationship, 


T  (Z(Jf|))  C  T  (Z(*^) 


and 


H(Zikz})CT(Z^:') 


for  all  Redefine  Z  as  the  maximum  cumulative  data  sets,  i.e.,  (J z (k)=  [J  Z^k\  Then, 

*>t  *>1 

for  any  track  r£T(Z),  we  call  rfV^  the  restriction  of  track  r  on  Z^T.  For  any  hypothesis 
XgH(Z),  we  call 

X|Z^  =  {rny(i)|r€X>\  {0} 

the  restriction  hypothesis  X  on  Z^k\  When  a  track  rj  is  a  restriction  of  r2,  we  say  r,  is  a  prede¬ 
cessor  of  r2,  or  equivalently,  r2  is  a  successor  of  rv  This  predecessor/ successor  relationship  is 
defined  on  H(Z)  through  the  above  restriction  operation  in  the  same  way.  Then  each 
predecessor/ successor  relationship  defined  a  partial  order  on  each  of  T(Z)  and  H(Z).18  An 
immediate  predecessor  of  any  track  or  any  hypothesis  is  unique  and  called  a  parent  (always  sin¬ 
gle).  Therefore  both  T(Z).and  H(Z)  are  arborescent  (tree-like)  with  such  partial  orderings,  i.e., 
the  set  of  predecessors  of  any  element  is  totally  ordered.  This  justifies  the  commonly  used  ter¬ 
minology  such  as  hypothesis  tree  or  track  splitting. 

We  conclude  this  section  by  the  following  remark: 


Remark  5:  According  to  Morefield’s  notation  in  [ll]19,  a  track  is  a  subset  of  a  cumulative 
data  set  Z  rather  than  a  cumulative  measurement  index  set  J  as  in  our  formulation.  When  a 
track  is  defined  in  such  a  way,  it  include  measurement  values,  and  hence,  the  a  priori  probability 
distribution  or  its  density  of  a  hypothesis  (as  defined  as  a  possible  collection  of  tracks)  is  very 
hard  to  calculate  and  may  not  be  well  defined.  For  example,  the  validity  of  the  following  two 
equations20  is  questionable: 


N, 


3 


J 


D 


P  (X)  =  exp(-i^  )  (uFA  )  fa/NFa 


where  uFA  is  the  expected  number  of  false  alarms  and  NFa  is  the  hypothesized  number  of  false 
alarms,  and 


18  The  partial  order  on  T(Z)  is  merely  the  set-indusion  partial  order. 

19  In  Morefidd’s  notation,  r  is  a  hypothesis  and  X  is  a  track. 

:o  Each  of  these  equation  is  in  [11]. 


18 
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where  I r  is  the  length  of  track  r  and  L  is  the  expected  track  length.  We  will  present  an  alterna¬ 
tive  batch-processing  type  formulation  in  Section  6  and  a  possible  application  0-1  linear  pro¬ 
gramming  techniques. 


4.  GENERAL  RESULTS 

This  section  describes  the  main  result  of  this  report,  i.e.,  the  recursive  hypothesis  evalua¬ 
tion  formula.  All  the  assumptions  made  in  Section  2  will  be  used.  However,  we  should  note 
that  we  have  made  very  general  assumptions  which  allow  dependence  among  targets  although 
the  target  interchangeability  is  still  crucial  to  our  derivation  of  the  general  algorithm.  To  expli¬ 
citly  specify  cumulative  data  sets,  we  denote  cumulative  data  set  up  to  k  by  Z^k\  and  accord¬ 
ingly,  the  cumulative  measurement  index  set  up  to  k  by  J^k\  Instead  of  (23)  and  (29),  we  use 

A*  ={{(A^)^)’|/€/DrW’and  1  <k<k}  |l<»<Av}\  {0} 


n*(r)=i  if  and  only  if  T  —  {(j  ,A)E/  I)  =A*(< )  and  l<k<k} 


Each  hypothesis  on  Z ^  can  be  evaluated  recursively  according  to  index  k.  With  usual 
notational  abuse  of  P ,  we  have 

for  each  k.  Since  P  (Z.^  \Z^k~^)  is  the  normalizing  constant,  the  hypothesis  evaluation,  i.e..  the 
calculation  of  P(A*  |Z(k)),  can  be  accomplished  by  calculating  P(Z(*\A*  |Z(*_,),A*_i)  if 
P(A*_1  |Z(*-1^)  is  known  under  the  recursion  assumption.  This  is  done  by  considering  all  the 
factors  involved  in  this  term,  i.e.,  the  sensor  models  specified  by  the  detection  probability  func¬ 
tion  Pp,  the  number-of-false-alarm  probability  function  PN  ,  the  measurement-value 
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probability  density  function  P.m  >  1  he  target  system  state  estimate  based  upon  the  previous 
cumulative  data  set  Z^k~l\  and  at  last  most  importantly,  by  random  assignment  of  measure¬ 
ments  and  target  interchangeability. 

To  accomplish  the  recursion,  we  must  chose  an  appropriate  set  of  variables21.  We  should 
have  a  complete  set  of  information  from  which  we  can  calculate  P(Z^k\\k  while  we 

wish  to  minimize  the  set  of  variables  which  we  must  carry  forward.  Taking  the  consideration 
made  at  the  end  of  the  last  section  into  account,  the  following  set  of  variables  to  be  propagated 
is  chosen:  For  each  k>  0,  we  define 

P^k)(\\Z^)  =  Prob.{\(k)=\\Z{k)}  ,  (36) 

P$Hn  |X,Z<*>)  =  Prob.  {Nt  =n  |A (*)=X,Z<*>)  ,  and  (37) 

Pkk)(dX  \u/i  ,\^k))  =  Prob.  {X(tk)£dX  |n*=u >flT=n  ,A (k)=\^%  .  (38) 

for  each  X£H(Z^),  each  n  >#  (X)22,  and  some  Only  one  ui  is  necessary  because,  in 

an  appropriate  sense,  (38)  is  invariant  with  respect  to  any  u£W(\/i),  as  shown  in  the  following 
theorem.  The  evaluation  of  such  an  u  is  also  unnecessary  thanks  to  Lemma  1  in  the  previous 
section.  The  following  theorem  is  the  main  result  of  this  report.  For  this  theorem,  we  define 

Z(O)=0, 

PkO)(0 10)=1  ,  (36’) 

P$  (n  1 0,0)  =  Prob.  {NT  =n  }  and  (37’) 

Pk0)(dX  |0/z  ,0,0)  =  Qn0{dX)  .  (38’) 


Theorem  1:  (General  Result)  For  each  k>0,  let  Z=Z(-k')  and  Z=Z<-k~1\  Then ,  for  each 
X£H(Z),  when  \  is  the  unique  parent  of  X  on  Z,  we  have 

[1]  (Target  System  State  Estimate  Updating)  for  any  n  >#  (X)  and  any  (\/i ), 


21  The  variables  (or  more  appropriately  called  "functions")  which  are  to  propagate  forward.  They  may  be 
called  filtering  states  or  saffictent  statistics. 

22  By  the  definition  of  A* ,  Prob.  {Nt  <  #  (A*)}=0  for  any  k . 
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k-/i  ,x,z)  =  cj?Xz(k)\u,\/i  zr 

P.U  (O',  (*  )),*"/* }  I  a  ^  (A  U/J.A)  P^  (NM(k)  -nD(\  \k)\D X /i  Jt) 

Pd(D  )X/iJi)  J  F^JdX  \X)Pik-lHdX\u*  XZ) 
x„ 

w/i^re  Cx  ij  the  normalizing  constant  for  (39)  and  defined  by 

CgXiyjnJkf^W^/i  2)  = 

J  Pm( y\a,mX/iJi)  PNFA(m  -nD(\ \k)\D  fX  yt  Ji) 
x, 

Pd{D  \X/iJt)J  F.^JdX  \X)Pjt-»{dX\*/i£2) 


(39) 


(40) 


for  each  (y/n)£  U  (Y,  Y"x{m},  nD(k  I*)  w  the  number  of  targets  which  are  hypothesized  (by  X) 


m  — 0 


to  be  detected  in  data  set  z  (k ),  i.e., 

Mx  l*)«  E#(tI*)  , 

rex 

and  Zj£\\  (X/i  ),  D  €D (n  )  and  a  £A(D  J M  (k ))  are  uniquely  defined  by 

cj(T)=u)(rry for  every  r£X  such  that  7 ■fV(*_1V0 
D  =  Dom  (a  )  =  {» e/m  (w)  J  (arty  )  j£ )  ^0}  and 
{a  (i )}  —  ar1(/)|A  for  all  i  €Z)  , 


(41) 


(42) 

(43) 

(44) 


{2]  (Target  Interchangeability)  for  any  n>#  (X),  C$\  |w,X/7  Z)  defined  by'  (40)  is  invariant  for 
all  cj£\V (Xyj  f  and  moreover,  for  any  u>£W (\/i)  and  any  permutation  it  on  (1  ,....yt  }, 


PitHdx  \u/i  ,\,z)  =  p$Hn(dx)M  ,\z) 

if  u-is  u  and  n  :  X„  —*Xn  is  the  n-target  permutation  homeomorphism  induced  by 
[3]  (Number-of  Target  Estimate  Updating)  for  any  n  >#  (X), 
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Pir}(n  !  X  ,Z )  =  C0;  (/: )  |  X  ,Z )~ 1  C kk)(z (k )  1  U,.\ /,  1 ) (n  |X,Z)  (46) 

( n  -#  (X))! 

where  C&  is  the  normalizing  constant  defined  by 
C$TX(y,mjkjk)\  X,Z)  = 

CO  -  (47) 

£  C^((y,m/^)  M/t  JZ)  £  ^  PtfT-l)(n  |X,Z) 

n-#(X)  (rt-#(X))l 

oo 

/or  eac/i  (y,m)€  U  0'itT  x  {m}  and  u  is  an  arbitrary  target -to-track  hypothesis  in  W  (X/t ), 
and 

[4]  (Hypothesis  Evaluation) 


P^(X|Z)  = 


P^CXIZ) 


(48) 


where  P^k\z (k)\Z)  is  the  normalizing  constant  and  Lk(z(k)^  \  Zf)  is  the  likelihood  of  (z(k),\) 
given  (Z,X)  defined  by 


L*((y^n^^),X  |X,Z)  =  Ctf)((y,rn,/*,s*)  |X£)  (49) 

m :  ' 


for  each  (y,m)€  U  (T,./"  x  ("i}  with  nD(\  |/c)  and  C&?  being  defined  by  (41)  and  (47/  respec- 

m  -0  *  r 

lively. 


Proof:  First  we  will  prove  [1]  and  [2]:  To  prove  II],  roughly  speaking  we  follow  the  Baye¬ 
sian  expansion, 


p(dx\nk^T,hkZ)  = 


P(Z,\k,Uk  I  Ak_ltQk_}J{  JVT  fZ) 
P  (Z >hk  ,Qk  \Ak_l£lk_-iflT  fZ) 


P(rtff  |fW/r,A*_„Z) 


abusing  notation  P  .  Therefore  we  must  calculate  P(Z, \k,Qk  \  Ak_lflk_lJ(  fL).  Let  m  and 
n  be  arbitrary  nonnegative  integers,  E  any  measurable  set  in  (Y*)m,  X£H(Z)  any  data-lo-data 
hypothesis  on  Z,  u!^W(\/t)  be  any  target -to-track  hypothesis  compatible  with  (X/j),  X£H(Z) 
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any  data-to-data  hypothesis  on  Z,  and  Zj€YV(X/j)  be  any  target-to-track  hypothesis  compatible 
with  (X,n  ).  Then  a  straightforward  but  rather  iengthy  application  of  Bayes  rule  gives  us  the  fol¬ 
lowing  expansion: 

Prob.  {:{k)£E*-{m  jkrsk},Ak=\fik=u.'  l^t-i~kXlk_l=uXOt)=X  J^T  =>1  Z-} 

S  />r06.{/OT(/k)=D|A*_,=XA-i=w^(/*)=A'^7-=;,^> 

D6D(n  ) 

Prob.  {Nm (k )=m  \IDT (k)=D  ,A*_,=X  flk-l=ZJ(tkyX Z) 

S  Prob.  {Ak=a  \NM{k)=m  JDT(k)=D,\k-\=^fh<-\=uJ({ik)=X yT=n  Z)  (50)23 

•  ex  (D.{1 _  _ 

Prob.  {A*=X  \Ak=a  flM(k)=m  ,A*_1=X^]*_i=i3rX’ (tk )=X  fir  =n  Z) 

Prob.  {nk-u  )Ak-a  fJM{k)=m  VA*=X/1*_,=D^'(/*)=A'  J^T=n  Z) 

Prob.{(yj(k))”ml£E  \Ak=a  flM(k)=m  ,Ak=kJTlk=u>J(Qk)=X  Z) 

The  first,  the  second,  the  third  and  the  sixth  factors  on  the  right  hand  side  of  the  above  equa¬ 
tion  are  given  by  (16),  (17),  (19)  and  (20),  respectively.  Namely,  for  the  first  factor,  we  have 

Prob.  {I or  —P>  |  (tk)=X  T  =n  £ } 

-  Prob.  {IDT  =D  \X (tk)=X  JJT=n}  =  Pd(D  \X /i  ,k) 

since  data  sets  are  conditionally  independent.  For  the  same  reason,  the  second  factor  is  given  by 


Prob.  { NM(k)=m  \ IDj=D  ,A>t_1=X,nt_)=u;rY (tk)=X  ~n  Z} 

=  Prob.  {NM(k)=m  i IDT=DJ(tk)=X&T=n} 
lPNfA(m-#(D)\DJC,nJi)  ifm>#(D) 

10  otherwise 


-{ 


Since  IDT  (k)=Dom  (Ak),  the  third  factor  is  given  by 


Prob.  {Ak  \NM(k)=m  JDT  —D  ,A*_i— X,n*=w,^ {tk)=X  flT=n  £} 


=  Prob.  { Ak  |  Nm  (k  )=m  JDT  =D  ,*  (/*  )=X  JJT=n}  = 


(m -#(£))! 

m ! 

0 


if  Dom  (a  )=D 
otherwise 


23  (50)  is  composed  as  P 0=^2p  xP 2Sp  jP *P  iP e-  By  'he  /  -th  factor  on  the  right  hand  side  of  (50),  we 
mean  conditional  probability  P, . 


In  the  sixth  factor,  as  far  as  data  set  z(k)  is  concerned,  the  condition  by  assignment  hypothesis 
a  and  the  condition  by  target-to-data  hypothesis  (w,X)  are  redundant.  Thus  we  have 


Prob.  {(y,  (*));"- 1€£  \Ak=a  J4s,{k)=m  ,.V*=\,n (tk)=X 
=  /  pm  (O';  (*))"- 1  \a /n  J<  /x  ,k)U  n(dy) )  . 

F  /-I 


On  the  other  hand,  the  fourth  and  the  fifth  factors  merely  check  the  consistency  among  X, 
X,  u,  Zj  and  a  .  The  fourth  factor  is  1  if  X  is  the  unique  parent  of  X  on  Z  and  if 


(51) 


where 

\ dd  =  {tU{(/  *)  U  «*  (c(7))}  |7^}  and 

=  {{(a  O'  )J< )}  I  *  ZDom  (a  )\Im  (S)}  . 


(52) 


It  is  0  otherwise.  In  (51)  and  (52),  X^  is  the  set  of  tracks  which  have  been  already  started  in 
y(k_1)  and  Xnw  is  the  set  of  tracks  corresponding  to  the  targets  which  X  hypothesizes  to  be 
newly  detected  in  data  set  z(k).  The  fifth  factor  is  1  if  w  is  the  restriction  of  cu  to  X  in  the  sense 
of  (42)  and  the  target-to-track  assignment  by  u  is  consistent  with  the  target-to-measurement- 
index  assignment  by  a  ,  i.e.,  it  is  1  if 


<Jj)  =  w(rn/(*"5>)  if  T€Xoid 
t  j*  =  {a  (u(r))}  ifrEKew 


and  0  otherwise.  Therefore,  for  a  given  target-to-data  hypothesis  (w,X),  the  fourth  factor  times 
the  fifth  factor  is  1  if  (<D  £)  fl)  satisfies  (42)  to  (44)24  and  0  otherwise.  Then,  since  a  is  one-to- 
one,  (44)  implies  nD(\  |k)=X#  (r  |k)=#  ( Dom(a )).  Thus  (50)  is  reduced  to 

r£X 


24  In  other  words,  if  (w ,D  a)  is  uniquely  determined  by  (42)  to  (44)  from  given  (wX). 
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(54) 


Prob.  {z  (k  )£E  *  {m ) , A k  =X ,Qk  =uj  |  A* _,=X ,nk  =Zj#  {tk  )=X  Ar  Z } 
=  Pd(D  \X  ^  J<)  PN^(m-nD(\  |A) J<) 


( m-nD(\  | A))! 
m ! 


/  pm  ( (y,  )/”_ 1 1 a  S”  X  *  *■ )  .ft  P,k  {dy, ) 

r  ^  “ 1 


if  m>#(P)  and  (~J)  m)  satisfies  (42)  to  (44).  Otherwise  the  left  hand  side  of  (54)  is  zero. 
Equation  (54)  together  with  "extrapolation"  equation, 

Prob.  {X(tk)edX  1 0*_,=w^r=n  ,Kk^=\Z) 

-  /  (55) 


implies  (39)  with  (40),  which  concludes  the  proof  for  part  [1).  Then  part  (2]  follows  immedi¬ 
ately:  The  second  statement  of  [2]  for  k=0,  which  should  mean  (5)  in  Section  2,  is  true  by 
Assumption  2.  Suppose  the  second  statement  of  part  [2]  is  true  for  it— 1.  Then  ,  by  the  inter¬ 
changeability  assumptions,  Assumptions  2  and  9,  we  have 

p jw ((y^n)  I®  7r“1,m,n(A,)/j  Ji)  PNfA(m  -nD(\  |A) \tr(D),U{X)/i  *) 

pD(n(D)\n{X)/ijc) J  F^mdxiintfVPg-'HnidF)!*^  ,\£) 

x* 


is  invariant  for  any  n-target  permutation  homeomorphism  n  induced  by  any  permutation  it  on 
This  means  that  the  normalizing  constant  C$\-  \u,\/i  £)  is  also  invariant  for  all 
w€W  (X/i )  (the  first  statement  of  part  [2])  and  the  second  statement  of  part  P)  is  also  true  for 
k.  This  completes  the  proof  of  parts  [1]  and  PJ. 

Next  we  will  prove  PJ:  To  do  this,  roughly  speaking,  we  must  calculate 
P{Z(k\ A*|A*_1>tfr>Z(*-,)).  Let  m  be  any  nonuegative  integer,  E  any  measurable  set  in 
(Y,t)m ,  XeH(Z)  any  data-to-data  hypothesis  on  2  and  n  any  nonnegative  integer  such  that 
n  >#  (X).  Then  it  follows  from  a  straightforward  application  of  Bayes  rule  and  from  (40),  (54) 
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and  (55)  that 


V 


Prob.  {:(k)£E*{m  j*a>,a*-X  |  A*_,=XJVr='i  ,Z} 
Prob.{:(k)eEx{m  A*=X,n*=u  |A*_1=XX2*_,=(u»  |X)^/T=n  ,Z} 
Prob.  |X)  |  A  *  _  | = X  tbJ  j  —n  ,Z} 


-  S  Prob.  {Plk_y— (_)  |X)  |  A*_i=X//r  =  n  ,Z) 

/  /V<*.  {;(/c)€£x{m>,A*=XA=w|A*_1=XA-i=(w|X)jr(/4)=A'^r«'»  Z} 

\ 

Prob.  {X(tk)£dX  |n*_j=(u/  \\)flT=n  ,A*_,=A,Z} 

“  ^  Ia*-i=^t r-«  ^>(n,-^|/C))!  /  CjtMy  /n  U  fy  \u±/t  Z)tCk(dy) 


(56) 


where  A  is  the  unique  parent  of  A  on  2  and  (u  |A)£W(A/j )  is  the  restriction  of  u  in  the  sense  of 
(42).  According  to  part  [2]  which  has  been  already  proven,  C^\-  \uZ/i  Z)  is  invariant  with 

n  t 

respect  to  u.  By  Lemma  1  in  the  previous  section,  we  have  #  (W  (A/i  ))= - : - and 

"  (n  -#  (A))! 


Prob.  {fi *.,=D  |  A*_,=A  J1T  =*Z}=  p 

u 

for  all  (A/i ).  Hence  (46)  and  (47)  follow  from  (56),  thus  completing  the  proof  of  [3]. 

For  any  m> 0,  any  measurable  set  £  in  ( Y,k)rn  ,  and  any  X£H(Z),  we  have 


Prob.  {z(k)£E'x{m},\k=\  |  A*_1=Xr^} 

00 

=  X)  Prob.  {z{k)£Ex{m},\k=\\\k_x— \flT=n  JZ}  Prob.  {NT  \\k_x=\Z} 

n  (\) 


(57) 


where  A  is  the  unique  parent  of  A  on  Z.  Therefore,  (48)  and  (49)  follow  from  (56),  (57)  and 
Leinma  1  in  the  previous  section,  completing  the  proof  of  part  [4].  The  reason  why 
Lk(z(k),  A  |Z,A)  is  called  the  likelihood  of  (z{k)Z)  given  (Z,A)  is  clear  in  (57).  Q.E.D. 


An  interesting  observation  of  the  general  result  stated  in  terms  of  Theorem  1  is  the  normal¬ 
izing  constant  of  the  Bayesian  formula  at  each  level,  i.e,  target  system  state  estimation  —*■ 
number-of-target  estimation  -*  hypothesis  evaluation,  which  is  always  used  in  the  next  higher 
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level.  As  the  final  stage  of  this  chain,  we  have  the  following  corollary: 


Corollary  to  Theorem  1:  The  likelihood  of  the  current  data  set  z  (A )  given  the  past  cumulative 
data  set  Zxk~x\  or  the  density  (in  an  appropriate  sense)  of  probability  distribution  of  the  current 
data  set  z  (A )  conditioned  by  the  past  cumulative  data  set  Z(k~x\  is  given  by 

P?X(yjnJkJk)\&k-")  =  E  L*((y^n^^),X  P{hk~x)(\  \ Z(k-‘>) 


for  each  (y  /n  )€  U  (Ys .  )m  *  {m  }  and  each  hypothesis  XgH(Z^)  whose  unique  parent  on  Z(k~l)  is 


The  proof  to  the  above  corollary  is  obvious  from  (48)  and  the  proof  of  part  [4]  of  Theorem 
1.  Theorem  1  gives  us  a  general  multitarget  tracking  algorithm  in  which  the  set  of  variables  to 
propagate  forward  is  (1)  all  the  hypotheses  in  H(Z)  and  all  the  tracks  in  T(Z),  (2)  evaluation 
P of  each  hypothesis,  (3)  number-of-target  statistics  Pf&  given  each  hypothesis  and  (4)  tar¬ 
get  system  stale  statistics  P given  each  data-to-data  hypothesis,  each  number-of-target s 
hypothesis  and  each  target-to-track  hypothesis.  By  imposing  additional  assumptions,  we  can 
derive  many  different  multitarget  tracking  algorithms  from  the  general  result  shown  in  Theorem 
1.  One  of  the  most  interesting  result  is  obtained  when  we  impose  the  independent  target 
assumptions  together  with  Poisson  assumption  on  the  number  of  targets,  which  will  be  discussed 
in  the  next  section. 


5.  I.I.D.-POISSON  CASES 

By  i.i.d.  (independent,  identically  distributed)  -  Poisson  cases,  we  mean  cases  where  several 
additional  assumptions,  the  most  importantly,  the  independence  among  targets  and  Poisson  dis¬ 
tribution  of  the  number  of  targets  are  added  to  our  basic  assumptions,  Assumptions  1-9.  With 
such  assumptions,  the  genera!  result  derived  in  the  previous  section  can  be  greatly  simplified.  In 
short,  in  these  cases,  we  may  evaluate  tracks  and  hypotheses  separately,  and  accordingly,  the 
filtering  equation  described  by  (39)  and  (40)  in  the  previous  section  is  also  decomposed  into  a  set 
of  track -wise  equations.  First  we  will  list  our  additional  assumptions: 
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Assumption  Alt  For  each  positive  integer  n  ,  the  common  component  of  the  target  system 
state  space  is  a  formal  singleton,  i.e.,  Xf^O}25  and  the  space  X'n  for  the  individual  target  states 
is  identically  equal  to  some  hybrid  space  X,  i.e.. 


X„  =  (X)n  =  Xx  ••••*  X  ( n  times) 


For  each  n>  0,  given  Nj=n,  the  first  component  X  of  the  stochastic  process  (X  (t  )/JT ), e|,  x) 

is  a  system  (.t,  )"_j  of  n  time-homogeneous26  Markov  processes  which  are  independent  to  each 
other  and  have  common  initial  distribution 

q0(dx )  =  Prob.  {x,  (/  0)£dx  } 


and  state  transition  probability 


feu  (d*  U)  =  Prob-  &  (f  U  if  )=*  > 


In  other  words,  we  assume 


es(fUi)-  n<?0(^) 

i-i  j-i 


(58) 


and 


=  ilftiidXi  IS) 

i-i  i-i 


(59) 


for  each  n  >  0. 


Assumption  A2:  The  a  priori  distribution  of  number  Nf  of  targets  is  Poisson  with  mean 
u0,  i.e., 


Prob.  {Nt  =n  }  =  exp(-t/0) 


un 

n ! 


23  X„c  will  be  ignored  henceforth. 

26  The  time  homogeneity  assumption  is  being  made  again  only  to  reduce  the  notational  complexity. 
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for  all  n> 0. 


Assumption  A3:  The  detection  of  targets  is  target-wise  independent  and  the  detection  of 
any  individual  target  i  depends  only  on  its  state  ,v, ,  i.e. ,  for  any  data  set  k ,  any  n>  0,  any 
(.x,  )/'_i€X"  and  any  D€D(n  ),  we  have 


pdV>  \(x,)r-i**)=nPD(*<  i*)***!  -Pot*,  j (60 ): 


with  a  common  target-wise  detection  probability  function  pD( ■  |&)  :  X  — *•  (0,1). 


Assumption  A4:  The  number  of  false  alarms  in  each  data  set  k  is  independent  of  target 
states  or  any  other  elements  in  the  data  set,  and  has  distribution  Pfj  (•  |£),  i.e., 


PsAm  \D pc /t  Jc.)  =  pN  (m  | k) 


for  any  (D  pc/i  )GUD(n)xX"x{n}.  Given  the  number  of  false  alarms  in  data  set  k,  the  vector 

n-1 


of  false  alarm  measurement  values  is  a  system  of  i.i.d.  random  elements  in  y,  with  a  common 
distribution  having  a  density  pFA  (•  |£)  with  respect  to  p,t. 


Assumption  A5:  The  error  in  a  measurement  originating  from  a  target  i  in  any  data  set  k 
depends  only  on  its  state  x,  and  can  be  modeled  by  a  common  state-to-measurement  transition 
probability  having  a  density  pM( ■  |-,A)  :Y,k*X  —*■  [0,oo).  Together  with  the  second  half  of 
Assumption  A4,  we  have 


Pm  ((y,  )jZ 1  \asnj),{x,  ),n_„rj  Jc )  =  (H  pM  (ya  (l )  \x,  f  ))•  (  ft  pFA  (y,  I  * ))  (62) 


1  (.Im  (a  ) 


for  each  n  >0,  each  (x,  )"_i€X"  ,  each  D£ D(n),  each  m> 0,  each  a  £A(D  ,{l,..../n  })  and  each 

<yj),ZtG(Y,tr- 


We  should  note  that  Assumptions  A1  -  AS  are  assumptions  which  are  "additional"  to 
Assumptions  1  -  9  made  in  Section  2.  For  example,  equations  (58)  -  (62)  satisfy  the  requirement 


27  For  any  set  A  ,  \(  A  )  is  the  indicator  function  of  A  ,  i.e.,  \(fl  'A  )  is  1  if  a  G4  and  0  otherwise. 
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of  the  target  interchangeability  by  Assumptions  1  and  9.  Before  stating  the  multitarget  tracking 

algorithm  resulting  from  these  assumptions,  we  will  introduce  several  helpful  notations  and 

preparations:  As  in  the  previous  section,  we  denote  the  cumulative  data  set  up  to  k  by  Z ^  and 

the  corresponding  cumulative  measurement  index  set  by  J f-k\  Let  the  maximal  cumulative  data 

set  be  denoted  by  Z,  i.e.,  Z={z  (k)  |/c  >  0}=  U  Z^k\  For  each  k  and  each  track  r£T (Z),  we  will 

k>  0 

associate  a  probability  distribution  on  X.  First,  for  each  k,  we  define 


y[r\k] 


if  r\k  -{/} 
if  r  \k  =0 


(63) 


where  6  is  used  symbolically  to  represent  the  fact  that  track  r  misses  measurement  at  data  set  k. 
Thus  y  [r  |A;]  is  the  measurement  value  yy  ( k )  in  Y,k  assigned  to  track  r  if  r\k  ={/  }  and  is  9  if 
no  measurement  is  assigned  to  r  at  data  set  k.  Then  we  define  cumulative  data  set  Z  restricted 
by  track  *€T  (Z)  by 


Zlr  =  {(y[T\k)Jt)\(z(k)Jc)eZ}  (64) 

which  is  actually  another  representation  of  a  sequence  (y[f  |l]>y[f  |2]r . )  of  random  elements 

in{*}U(Ur,). 

Consider  a  Markov  process  x  on  X  which  has  initial  state  distribution  q0  and  state  transi¬ 
tion  probability  .  Then,  for  each  r,  we  can  consider  (y  [r  |*])*>o  or  equivalently  Z  |r  as  a 
sequent  of  observation  data  from  an  incomplete  observation  mechanism  which  provides  a  meas¬ 
urement  value  in  Y,k  if  the  observation  is  successful  and  provides  nothing  (represented  by  6)  oth¬ 
erwise,  for  each  k,  according  to  the  statistics  specified  by  pp  and  pM.  Such  an  observation 
mechanism  can  be  modeled  by  g{ ■  |-,fc)  :  (y,tU{0})xX  — ►  [0,oo)  which  is  defined  by 


g(y  M)  = 


{p\t(y  U*)pd(x  !*) 

(1  I*) 


ily^e 

if  y=9 


(65) 


for  each  k.  For  each  cumulative  data  set  Z  and  each  track  r£T(Z),  we  denote  the  conditional 
distribution  of  the  state  of  this  Markov  process  at  time  t ,  conditioned  by  Z  |f  (Z  restricted  by  t) 
by  Pi  (■  | Z  |r)  which  we  call  track  state  distribution.  Then  we  have  a  standard  formulation  of  non¬ 
linear  filtering  problem  with  a  continuous-time  Markov  process  and  discrete-time  observation 
mechanism  defined  by  (65).  Thus  we  have  the  following  lemma: 
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Lemma  2:  For  each  track  r£T  (Z(-k))  on  each  cumulative  data  set  Z(k\  we  have 


Ptk Utx \Z\k))  =  c,( y  \r \k } \Z \krl) )-'  g (y  [-  U 1  \x  ,k )p,t(dx  \Z  ) 


(66) 


where 


c*(y|z^  1))  =  /  g(y  \xM)plk{dx  |zj*-y(t-,)) 


(67) 


for  each  y £YSk  U{^}  and 


plt(dx  | zj^yU)  = 


/  f^Jdx  \x)ptk  fdx  iZf-^.n)  if  A>  1 
x 

/  (<**  l*>?o(<fr)  >f  *=1 


(68) 


Lemma  2  provides  a  tracking  algorithm  for  single-target  no-false  alarm  cases.  It  is,  how¬ 
ever,  nothing  but  a  standard  form  of  the  nonlinear  filtering  equations  based  on  sampled  data 
and  is  obtained  by  straightforward  application  of  Bayes  rule.  Hence,  it  is  not  necessary  for  us  to 
provide  a  proof.  The  algorithm  consists  of  the  updating  formula  (66)  with  (67)  and  of  the  extra¬ 
polation  formula  (68).  Using  the  notion  of  track  state  distribution,  the  main  result  of  this  sec¬ 
tion  can  be  written  in  form  of  the  following  theorem: 


Theorem  2:  (I. LD. -Poisson  Cases)  For  each  k,  let  Z=Z<'k\  Z=Z(*  ^  and  J  W (A  ]\  Then, 
for  each  hypothesis  \£H(Z)  with  the  unique  parent  X  on  Z,  we  have 

p]  (Target  System  State  Estimate)  for  each  n  >#  (X)  and  for  each  w€\V(X/i ), 
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pkk)( fux,  \uj/i ,\z)  =  (n Pli{dxu(r)\zv))  (  n  Plk(dx,  i zg,))  , 

* “ ■  f ^  X  1*1 


i€lm(w) 

pi]  {Number -of -Tar get  Estimate )  for  each  n  >#  (X), 

(n-#  (X)) 

P£>(/i  |X,Z)  =  exp(-v*)  — - 

'  F  (n-#(X))! 

^=E  {Nt  -#  (X)  |X,Z)  which  is  given  recursively  by 


vk  =c{6\Zft)vk_x 


for  each  k>  0  with  c(8  \  Z  1 0)  being  defined  by  (67), 
and 

pii]  (Hypothesis  Evaluation) 


P^(X|Z)  =  Ctf)(Z)-,Pr,)(X|Z)  ^(e(A)|X)  nz*(y[r|4:]|^n7) 


tv/iere  C^k)  (Z)  /s  the  normalizing  constant  defined  by 


Ctf}(Z)  =  Nm(A)!  exp  (uk.x-uk)  P?\z(k) \Z)  , 


LkA(-  IX)  is  the  false  alarm  likelihood  function  defined  by 


LkA((y,)f-\SnJkJk  IX) 

=  (m-«D(XlA:))!pN  (m-n0(Xl<:))  n  Pfa  O'/!*) 

/e/M(m>|*y 


for  each  {iyt  )G  Q  (^T*  W  with 

m-0 


iFAim>  I*)*  {!» . /n}\  (U(r|*)) 


being  the  set  of  indices  for  measurements  which  X  hypothesizes  to  be  false  alarms,  and  L*(-  |^,nj) 
is  //a?  track-to-measurement  likelihood  function  defined  by 


ywwwwv. rjinAAB.  v.'v'. nn  v. .\n rj>. tw,  a  \>:  ~J“ ry  "j>  ■•■>  ">  n_K isxrjr\yn uw-xrjr*.  vr w^jr«  W'  v*  .> w y>  v» yv  uT' 
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Lk(y  l^ry)  — 


»k-iCk(y  1 2 10) 

c*Cv  1^  |,p|/) 


if  TnJ=e. 

otherwise 


for  each  y  £4  (J {0}. 


Proof:  First  we  will  prove  p]  and  pi]  by  mathematical  induction,  p]  and  pi]  are  obviously 
true  for  k=0.2S  Suppose  that  p]  is  true  for  k—  1.  Let  X£H(Z)  be  an  arbitrary  hypothesis  on  Z, 
n  00  and  tj£W  (X/i ).  Let  D£D(n ),  ZD£\V(X/t )  and  a  £A(D  Oat (^))  be  determined  uniquely 
by  (42),  (43)  and  (44).  Then,  it  follows  from  the  additional  assumptions  (59)  -  (62)  and  from  the 
extrapolation  equation  (68)  that,  for  any  m  >0,  any  measurable  set  £  in  X"  ,  we  have 


/  pm  (O',  T-i  I  a  jr\  ,{x, );_,  /t  Jt )  P„fa  (m~nD(\\k))\D  ,(x, )(".,i  /i  Jc ) 

pd(d  \(x, ):.>*) f  p,;*t  l(flk  i&)r.l)p?-t>(ft‘*.  i^az) 

1  lm  1 

X" 

/ (n pM{yt(,)  k))  (  ft  Pfa  (y;  I*))  PnFa{™  -*:>(»■  I*)  I*) 

£  *  _  (77) 

Cfynk  -pD(x,  |*))<“**»)  (Jb^dx-^  |Zp))(  ft  p,k(dx,  1 4)) 

=  PsfA(m  “”d(x  I*)  1*0  (  n  Pfa  (y,  I*)) 

J  tlpA  (m  **  I*) 

f(nPuo  •  <-/<»■■>  l**i(r)’h)PD(Xv(,)  \k)Ptt(dx  ,j(,)  |Z|rr)j))(  n  (1  ~~PD  (Jw<r)  P  ))P«4(^*w{r)  I  Z|tr-.  j)) 

.  1  w(f)€D  w(r  )€0 

<fl  (l  -/>ok  l*)K(<k  14)) 


iClin(w) 


where  nD(\  |£)  is  the  number  of  targets  which  are  hypothesized  (by  X)  to  be  detected  in  data  set 
z(At)  defined  by  (41).  Then  (69)  follows  from  (77),  Lemma  2  and  part  [1]  of  Theorem  7,  i.e,  p]  is 
also  true  for  k,  concluding  the  proof  of  p]. 

It  also  follows  from  (67)  and  (77)  that  the  normalizing  constant  defined  by  (40)  is  given  by 


28  For  k  =0,  the  left  hand  sides  of  (69),  (70)  and  (72)  are  defined  by  (38’),  (37’)  and  (36’),  respective!) . 
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C^HCv’y  )jm-iJn;kjk  lu),X/i  ,Z)  =  (m-nD(\\k)\k)  (  FI  Pm(>'/I*)) 

M  iejFA(m>  \k) 

/  T-~r  _  \  (  _  \  /  __  \f1  (X)  ("^) 

(  n  ^ Cv« <^(0) I 7 ))(  n  c^l2vn/))(c^  l2E))) 

w(r)€D  u(0*D 

where  each  c  (•  |)  is  defined  by  (67).  Obviously,  cc€W(\/z)  is  arbitrary  in  (78).  To  prove  [ii], 
assume  that  [ii]  is  true  for  k  — 1.  Then,  for  any  n  >#  (\)  and  any  MEW (\/i ),  it  follows  from 
(78), 


I^X/t  |X,Z) 


for  each  (y,m)€  U  (V,  X"  *  {m  }  where 


d  =  PAfe, (m-fiD (X  |* )  | * )  (  1^  ]kf>FA  O'/  !*))(  n  c(y, Mr))  I /)) 


w(r)€D 


-  ' 

i. 

(  El  c(<?|4nj))exp(-t/k_i)  . 
w('*D 


Then,  according  to  part  [3]  of  Theorem  1,  (70)  and  (71)  follow  from  (79)  and  (80),  i.e.,  pi]  is  true 
also  for  k,  concluding  the  proof  of  pi]. 

From  (79)  and  (80),  we  can  calculate  the  normalizing  constant  defined  by  (47)  as 

C$(yjmJkfk  | yZ)  =  d  exp{»k){vk_<f*<»*®)  (81) 

where  d  is  defined  by  (80).  Let  Xnfw  and  X^  be  defined  by  given  X,  given  n  and  an  arbitrary  u 
through  (52)  with  (a  ,u)  being  uniquely  determined  by  (42)  and  (44).  Then  we  have 
#(Xnw)=#  (X)~#  (X)  and 

(nc(y[rlA]|^n7))(^_i)#(XH,(  } 

,€  (82) 
=  (  n  ck(y[T\k]\Zynrj)(  n  ^_iC*(y[r|^]i^))  . 

re*0u  f€x,«w 
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It  follows  from  (74)-(76)  and  (80)  -  (82)  that  the  likelihood  L*(z (A:),X  \Z,\)  of  (-(A;),X)  given 
(Z,X),  defined  by  (49)  becomes 


LA. (;(*), X  | Z,X)  = 


exp(t/A  -vk_x) 


4M(c(^)|X)nLA(y[r|A:]|^n7) 

*6  ^ 


(83) 


Then  pii]  follows  from  part  [4]  of  Theorem  1. 


Q.E.D. 


We  should  remember  the  empty  track  0  is  always  included  in  T(Z)  and  p,( ■  | Z^)  is  the 
common  a  posteriori  distribution  at  time  t  of  targets  which  are  not  detected  in  any  data  set 
included  in  Z.  The  reason  why  L*  defined  by  (76)  is  called  track-to-measuremenl  likelihood 
may  become  dear  in  the  following  interpretation  of  hypothesis  evaluation  equation  (72):  The 
posterior  probability  of  any  hypothesis  X  is  the  product  of 

(1)  a  priori  (with  respect  to  the  current  data  set)  probability  of  X,  i.e.,  the  past 
evaluation  P j^_1^(X  1 Z)  of  the  unique  parent  ^  of  X  based  on  the  past 
cumulative  data  set  Z, 

(2)  the  likelihood  L+A(z (k)  |X)  of  a  set  of  measurements  indexed  by 

jFA  | k)  being  the  set  of  false  alarms  in  data  set  z(k), 

(3)  the  likdihood  I*(y[r  |/c]  \Z^^j)=ck(y  lr  1*1  l^nj)  °*  8  measurement 

y[r\k\^6  originating  from  a  previously  detected  target  whose  track  is 
rfW0  in  Z, 

(4)  the  likelihood  L*(0  1 2^.nj)=cfc(0  |2^nj)  of  a  previously  detected  target  whose 

track  is  rfV" j^0  in  Z  being  undetected  in  the  current  data  set  z(k),  and 

(5)  the  likdihood  Lk[y  [t  |/c] \Z^)=vk_-lck{y  [r  | Ac]  jZ^)  of  a  measurement 

y  [r  originating  from  a  newly  detected  target  (rpU  =0), 


divided  by  the  normalizing  constant.  Likewise,  we  call  Lk(0  \Z^)=vk^ck{6  | Z^)  the  likelihood  of 
an  undetected  target  remaining  undetected.29 


Remark  6:  Part  [i]  of  this  theorem  is  valid  even  without  Assumption  A2,  i.e.,  a  Poisson  dis¬ 
tribution  on  number  N-j-  of  targets.  In  other  words,  p]  is  a  consequence  of  all  the  independence 
assumptions:  targets,  ddection,  measurement  errors,  etc.  We  should  note  that  the  target  distri¬ 
butions  are  independent  only  when  conditioned  by  a  target-to-data  hypothesis  (u’,X).  As  dis¬ 
cussed  in  Section  3  ( Example  1),  when  we  mix  (69),  this  independence  will  be  generally  lost.  On 
the  other  hand,  for  parts  pi]  and  pii),  Assumption  A2  is  crucial.  Thanks  to  this  assumption,  the 
conditional  probability  (under  hypothesis  X)  on  the  number  Nr-4t  (X)  of  targets  which  are  not 

29  By  (71),  this  likelihood  is  nothing  but  the  conditional  expectation  erf  the  number  of  targets  which  are 
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detected  in  cumulative  data  set  up  to  k  is  independent  of  X  or  of  any  measurement 

(Cv;  (*))>-*/  \^\t(^))k<k -  I*  is  also  Poisson  and  depends  only  on  the  information  about  which  sen¬ 
sor  was  up  at  each  particular  time,  i.e.,  (ti„Si)k<k-  When  we  exclude  Assumption  A2,  we  must 
calculate  the  a  posteriori  distribution  of  number  NT  of  targets  separately  for  each  hypothesis  X, 
although  such  statistics  depend  only  on  the  number  #  (X)  of  tracks  in  X. 


Even  with  the  additional  assumptions  introduced  in  this  section,  Theorem  2  provides  a  gen¬ 
eral  multitarget  tracking  algorithm  which  includes  many  existing  algorithms  as  a  proper  subset 
in  a  sense  which  we  will  describe  in  the  next  section.  Before  closing  this  section,  we  will  show  a 
batch-processing  algorithm  as  a  corollary  to  Theorem  2. 


Corollary  to  Theorem  2:  For  each  K  and  each  X6H(Z^),  we  have 

i  z<*>) = (fiuFA(\j(k)j)  (n  iK{r^K)j) 


where  is  the  normalizing  constant  defined  by 


=  P2(Z<*>)  exp(u0-uK)  TlNM(k)\  , 

k- 1 


and  lK  (jfZ^K  ))  is  the  track  likelihood  of  t  at  k  defined  by 


Ik(tZ(K))  =  t^o ftc*  (y  (H  *  ]  |  z  irnV(*-'> ) 


with  each  ck(-  |  )  being  defined  by  (65). 


This  corollary  can  be  proved  by  a  straightforward  repetitive  application  of  Theorem  2 
although  one  must  carefully  handle  the  null  track.  In  order  to  obtain  track  likelihood  Ik{t£{-K)) 
for  each  track  r,  we  need  to  calculate  the  normalizing  constant  ck{-  |-)  of  the  filtering  process 
according  to  (67),  which  means  that  we  must  recursively  calculate  the  track  state  distribution 
according  to  the  Filtering  equations  (66)  -  (68).  The  track  likelihood  itself  can  be  calculated 
recursively  as 

not  detected  in  any  data  set  up  to  and  including  k . 


r.V 

J 


BS 


36 


YVV'O'V  V 


TR-1048-01 


May  12,  1988 


lk(r,Z^)  = 


i(k=0 


k*(ylH*]|Z^».-,,)/*_1(rrV<ft-1>fZ{*-,>)  otherwise 


l(T?i»)=.L'k  ifm  J(k)=2 


which  follows  immediately  from  (86).  As  shown  in  the  last  equation,  the  null  track  likelihood 
/*(0,Z<*>)  is  nothing  but  the  expected  number  uk  of  undetected  targets,  which  is  common  for  all 
the  data-to-data  hypotheses.  Using  (85)  and  (86)  we  can  actually  calculate  a  priori  probability 
of  each  hypothesis  X,  i.e.,  Prob.  •{A=X}.  Its  calculation,  however,  involves  the  whole  evaluation 
processes.  This  is  a  reason  why  we  may  not  use  a  simple-minded  Bayesian  expansion  such  as30 

p(x|z)  =  ^pfz7lp(x)  • 


6.  RELATION  TO  EXISTING  ALGORITHMS 

In  an  appropriate  sense,  the  algorithm  provided  by  Theorems  1  and  2  in  the  previous  sec¬ 
tions  includes  representative  multitarget  tracking  algorithms  which  have  been  developed  up  until 
now,  as  a  proper  subset.  This  section  will  show  this  by  describing  the  relation  of  our  algorithm 
developed  in  the  previous  section  to  representative  existing  algorithms.  Since  Reid’s  algorithm 
described  in  [4]  gave  us  the  most  significant  motivation  for  this  report,  we  will  spend  dispropo- 
tional  space  for  it. 


30  Cf.  Remark  5  in  Section  3. 
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6.1.  Reid’s  Algorithm  Besides  all  the  assumptions  we  have  made  by  now,  in  [4],  the  indivi¬ 
dual  target  state  transition  probability  is  given  by  a  linear-gaussian  model, 

.v( (/A )=/rAr.x,  (tk_\)+gaussian  noise,  and  the  measurement  error  is  also  linear-gaussian  as 
}'i  (k)=Hkx,  (t  k)+gaussian  noise ,  where  the  individual  target  state  space  X  is  a  Euclidean  space 
and  matrices  F  and  H  are  with  compatible  dimensions.  Moreover,  each  measurement  value 
space  Ys  for  each  sensor  s  is  implicitly  assumed  / o  be  a  compact  set  in  a  Euclidean  space  with 
search  (or  scan )  volume  vs(Ys).  In  other  words,  each  Ys  is  the  field  of  view  of  sensor  s.  By 
assuming  uniform  probability  detection  on  the  field  of  view  for  each  sensor,  we  have 

Pd(x  I *)=PaBM  J G(y-Hkxfik)dy  (84) 

y.L 


where  ppJ“€(0, 1),  (?(•;=)  is  the  density  of  the  zero-mean  multidimensional  gaussian  distribution 
with  variance  matrix  E  and  Rk  is  the  measurement  noise  variance  matrix.  Accordingly,  the 
state-to-measurexnent  transition  may  be  modeled  as 


Piu(y  M)  = 


G{y-Hkx\Rk ) 
jG(r,-HkxJik)dn 


Assume  that  the  initial  distribution  q0  of  the  individual  targets  is  thinly  spread  over  a  large 
region  and  that  the  measurement  error  variance  Rk  is  very  small  compared  with  the  field  of  view 
for  each  sensor.  Then  we  may  conclude  that  each  nonempty  track  has  a  state  distribution 
which  is  well  approximated  by  a  gaussian  distribution.  Thus,  for  each  data  set  k  and  each  track 
r€T(Z(<0)  such  that  r=rrV(*_I)/0,  we  can  approximate  the  track  state  distribution 
p,t{-  by  a  gaussian  distribution  with  mean  xk  |*_j  and  variance  E*|*_i.  Then,  for 

y£Y,t,  the  track-to-measurement  likelihood  defined  by  (76)  becomes 

Z*(y  G(y-Hkik\k-x  -,HkXk\k_lHl+Rk) 

_i  .  (86)31 

-  Pd**  (2jt)  2  det  (HkEk  ,  k.^+Rk)  2  exp(-i,Y2) 


where 


Ht  is  the  transpose  of  matrix  or  a  vector  H . 
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PFA(y  \k)  =  (n,k(Ytt))- 


When  we  add  a  series  of  approximations  mentioned  above  and  the  additional  assumptions 
((84),  (85),  (88),  (89),  etc.)  to  the  general  algorithm  shown  as  Theorem  2  in  the  previous  section, 
we  obtain  exactly  Reid’s  algorithm  described  in  [4]  if  we  define  false  alarm  density  by 

0FA(k)»uFA(kyntk(Ytk)  (90) 

and  density  of  previously  unknown  target  that  has  been  detected 34  by 

^r(^)  =  4(y[HMlzi0,))»^-ic*(ykU)|zf0-1))  (9i) 

where  y  (r  |k]  is  the  value  of  the  measurement  which  is  hypothesized  by  track  r  to  originate  from 
a  newly  detected  target. 

We  should  note  that  in  [4]  the  left  hand  side  of  (91),  i.e.,  fjNT(k),  is  a  constant  for  any 
given  k  while  the  right  hand  side  is  a  function  of  measurement  value  y  [r \k].  Thus  (91)  shows  a 
crucial  difference  between  our  algorithm  and  Reid’s  in  [4).  In  fact,  ^^(k)  should  be  a  function 
of  k  and  the  measurement  value,  since  otherwise  newly  started  tracks  (hypothetical  newly 
detected  targets)  may  be  given  increasingly  unjustifiably  high  possibilities.  For  example,  sup¬ 
pose  we  are  watching  an  area  for  a  sufficiently  long  period  of  time  with  a  reasonably  high  proba¬ 
bility  of  detection.  Then  the  chance  that  we  see  a  target  which  is  detected  for  the  first  time  in 

the  middle  of  the  field  of  view  is  very  low.  This  is  so  since  if  that  happens  such  a  target  must 
have  evaded  detection  many  times  while  moving  around  or  reaching  the  middle  of  field  of  view, 
which  is  very  unlikely.  To  prevent  this  from  happening,  Reid  proposed  to  adjust  his  0NT  as 
described  in  a  paragraph  in  [4]: 

. .  a  calculation  of  ,  the  density  of  new  (i.e.,  unknown)  targets,  is  performed  whenever  a 

data  set  from  a  type  1  sensor  is  received.  The  density  of  new  targets  £vr  depends  upon  the 
number  of  times  the  area  has  been  observed  by  a  type  1  saisor  and  the  possible  flux  of  undetected 
targets  into  and  out  of  the  area. 

Aside  from  the  above  description,  there  is  no  further  explanation  as  to  what  "calculation"  is  per¬ 
formed  to  obtain  appropriate  0ST.  In  earlier  literature  [14]  by  Reid,  a  rather  heuristic  method 
for  calculating  0NT  is  described,  in  which  the  target  state  space  is  divided  into  many  square  cells 
and  the  inflow/ outflow  of  undetected  targets  from  cell  to  cell  is  considered.  In  effect,  this  kind 
of  procedure  corresponds  to  the  calculation  of  expected  number  vk  of  undetected  targets  and  the 
state  distribution  p,  (•  1 2  |0)  of  null  track. 

M  Reid’s  terminology  in  [4]. 
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The  derivation  of  Reid’s  algorithm  in  [4]  is  based  upon  the  following  Bayesian  expansion: 
First  let 


'Yvr(^)  =  #(A k)~#  (A*_i) 


be  the  number  of  newly  detected  targets  in  data  set  k.  Reid  then  assumed  that  NSJ(k)  has  a 
Poisson  distribution  with  a  given  mean35  ^>7- (A)  and  that  the  measurement  value yA  (l  ^  originat¬ 
ing  from  each  newly  detected  target  i  is  uniformly  distributed  on  the  Geld  of  view,  y  With 
the  uniform  detection  assumption,  the  density  of  newly  detected  target  is  calculated  as 


0KT{k)-l'NT{kynt<tySk)  . 


N&(k) 


-  # 


(A*-on  iDT(k)j 


be  the  number  of  targets  which  are  detected  in  some  data  set  included  in  Z ^  ^  and  also  in  the 
current  data  set  z(k).  Then,  again  with  the  uniform  detection  assumption,  we  have 


pwb.  {.v&(k)=n  (92) 

n  !  (#  (X)-n )! 


for  each  each  integer  n  such  that  0<n  <#  (X).  In  other  words,  the  probability  dis¬ 

tribution  of  N$j(k)  conditioned  by  A*_j  becomes  a  binomial  distribution  with  mean 
#  (A  ic-i'Pd**-  Then,  conditioned  by  Nj^-^k)  and  N£~{k),  the  probability  distribution  of  the 
number  NM(k )  of  measurements  in  the  current  data  set  z(k)  is  determined  by  the  number-of- 
false-alarm  probability  function  Pnfa>  i-e->  (88)-  Assuming  that  NpF(k),  N^ik)  and  NFa  (k)  are 
independent  when  conditioned  by  (A*_ltZ^-1^),  we  have 


Prob.  {N&{k)=nDT  ^T{k)=nNT  flFA  (k)=nFA  \Ak.^\,Z^}  =  «*<^(fc)+^(k)) 


#(X)! 


nNT  ■  Hpr  •  (#  (X)”tl£)r  )• 


W*)narQ  -PS”) 


To  obtain  the  final  form  of  Reid’s  algorithm,  the  following  two  equations  are  used: 


35  Pd**  's  included  in  D^T(k)  since  NNF(k)  is  the  number  of  newly  targets.  Roughly  speaking,  we  have 
UST  {k)=P03Xl'k- 1- 
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Prob.  {A*—  X  {k)=nFA  ,Nf)f  =n qt  =n\T 

_  n FA  •  nST  •  nDT  ■  (#  &)~nDT  )• 

(« fa  +"dt  +nXT  )■  #  (X)! 


(94) 


nnd 


Prob.  {( V;  (*))/"-ie(4vy  r-1  ,\k=\JZSk~»} 

^jnFi4  (m  |/f) 

(  ri  gcv,  -*ut(zj*-y(*-.>) ;  ) 

r€x«*  )-l 


(95) 


where  Jr, t(Zj*Qy\*-i))  and  S,4(Z  j*^y\*-t))  are  the  mean  and  the  variance  of  the  track  state  distri¬ 
bution  p,k(-  | Z (*-*>(*-„)  which  is  assumed  to  be  gaussian. 

Reid  obtained  his  final  form  by  multiplying  the  three  key  equations,  (93)  -  (95).  In  all  of 
these  equations,  target  interchangeability  is  used  implicitly.  With  closer  observation,  we  may 
question  the  exactness  of  these  equations:  For  example,  when  we  integrate  (95)  over  (Y,  y" ,  the 
integral  will  be  strictly  less  than  one  while  it  should  be  exactly  one.  This  is  so  because  Ys  is  a 
compact  and  hence  a  proper  subset  of  a  Euclidean  space.  Moreover,  (93)  and  (94)  are  valid  only 
when  the  probability  of  detecting  targets  which  have  been  detected  before  is  the  same  for  all 
such  targets.  For  example,  if  a  target  leaves  the  field  of  view  of  a  sensor,  such  an  assumption 
certainly  invalid.  The  same  conclusion  may  be  said  when  a  target  is  detected  by  a  sensor  but  is 
outside  of  the  field  of  view  of  another  sensor,  or  when  a  target  is  on  the  edge  of  the  field  of  view 
of  a  sensor.  On  the  other  hand,  according  to  our  formulation  shown  in  the  previous  sections, 
the  statistics  NpF(k)  can  be  calculated  as 

Prob.  {Nj#=n  \\k^=lX{k~'l)} 

=  £{n(/pz>(*  I k)p,t(dx  izfr1*))6^!  -  JpD(x  | k)p,k{dx  izj*-*)))1-^ 

r£X  x  x  (9- ) 

|ex— *{o,i},_£e(r)=n} 


Similarly,  the  statistics  on  number  N^'T(k)  of  newly  detected  targets  on  data  set  k  is  calculated 
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for  any  X£H(Z^~n)wh  ere 


P  =  f  Pd(x  \k)P<Mx  \z$~]))  • 


Namely,  in  our  formulation,  the  Poisson  assumption  on  the  total  number  of  targets  plus 
independence  assumptions  together  imply  that  the  distribution  of  the  number  N^j{k)  of  newly 
delected  targets  at  k  is  Poisson.  While,  in  [4],  (96)  is  used  as  an  assumption.  Traditionally, 
newly  detected  targets  are  treated  by  a  very  ambiguous  notion  of  "Poisson  arrival"  with  which 
any  underlying  target  model  is  not  clearly  stated.  Our  formalism  is  based  upon  a  very  simple 
observation  that,  if  a  target  is  newly  detected,  it  must  exist.  This  includes  the  case  where  a  tar¬ 
get  is  born  in  the  middle  of  the  field  of  view  of  a  sensor.  In  other  words,  our  concept  of 
existence  is  independent  of  time. 

Reid  also  extended  his  linear-gaussian  result  outlined  above  to  non-gaussian  cases  (totally 
discrete  space  and  hybrid  space)  in  [12],  [15]  and  [16].  Besides  the  problem  mentioned  above,  the 
uniform  detection  assumption  may  be  inadequate  in  some  applications.  For  example,  when  a 
surveillance  region  includes  masking  areas  or  sensor's  detection  is  highly  dependent  on  a  target 
state  component  (such  as  MTI-type  radars,  emission-detection  type  sensors,  radars  with  cross- 
section  sensitive  detection,  etc.),  this  assumption  may  be  unacceptable.  In  such  a  case,  the 
detection  probability  function  must  be  included  in  an  integrand  of  the  track-to-measurement 
likelihood  rather  than  a  constant  multiplier,  and  accordingly,  the  track  state  distribution  must 
be  updated  by  assumption  that  the  hypothesized  target  has  evaded  the  detection.  When  we 
ignore  this  consideration,  however,  there  is  no  need  for  re-evaluation  of  either  hypotheses  or 
track  state  distributions  by  a  no-measurement  data  set  (6, Oj  f).  This  is  why  Reid  separately 
treats  type-2  sensors  which  generates  oily  data  sets  each  of  which  contains  at  most  one  measure¬ 
ment.  Apparently,  when  no-measurement  data  sets  are  valuable  information,  such  treatment  is 
inadequate. 


6.2.  PDA  and  JPDA  As  shown  in  the  previous  subsection,  the  most  difficult  part  of  multi- 
targe’  tracking  problems  arises  when  the  number  of  targets  is  not  known  so  that  we  must 
hypothesized  each  measurement  as  a  potential  new  target  which  has  not  been  recognized  before. 
Thus  if  the  number  of  target  is  known,  considerable  simplification  is  possible.  The  PDA  ( Proba¬ 
bilistic  Data  Association  [17] )  and  the  JPDA  {Joint  Probabilistic  Data  Association  [18]  and  [19] ) 
algorithms  are  based  upon  the  assumption  of  known  number  of  targets.  Moreover,  one  of  the 
underlying  assumptions  for  these  algorithms  may  be  characterized  as  targets  with  a  priori  identif¬ 
ication  as  discussed  later.  First,  we  will  show  that  our  algorithm  can  be  reduced  to  PDA  or 
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JPDA  algorithm  with  additional  assumptions: 

To  do  this,  aside  from  all  the  assumptions  made  in  Sections  4  and  5,  we  add  an  extra  sen¬ 
sor  which  we  may  call  "super  sensor"  providing  a  priori  information  on  targets.  Let  i  be 
SU{T}  and  T£S.  Assume  that,  at  k  = 0  and  only  at  k= 0,  sensor  J  provides  a  data  set 
(O',  (O)V'-i^  and  that,  in  this  special  data  set,  the  probability  of  detecting  all  the  targets  is 
ore  and  there  is  no  false  alarm.  The  effect  of  having  this  special  data  set  is  twofold;  (1)  The 
number  of  targets  is  exactly  known,  i.e.,  Nf=n,  and  (2)  targets  are  labeled  by  measurement 
index  (i  ,0).  Suppose,  as  a  result  of  update-by-measurement,  we  have  the  track  stale  distribution 
p, (•  |Z^|,  o)})  for  track  {(/  ,0)}.  For  each  data  set  k>  0,  because  Prob.  {NT  —n  }=1  the  likelihood 
of  any  measurement  originating  from  a  newly  detected  target  is  always  zero.  Therefore,  when 
we  extend  every  cumulative  data  set  to  include  the  super  sensor  data  set  k— 0,  all  the  hypotheses 
in  H(Z)  for  any  cumulative  data  set  Z  with  positive  a  posteriori  probability  have  the  same 
number  of  tracks  which  uniquely  correspond  to  each  other  through  their  unique  predecessors  on 
Z^°\  On  the  other  hand,  H(Z)  is  a  mutually  distinct  and  collectively  exhaustive,  set  of 
hypotheses  on  Z,  or  {{A*=X}|X€H(Z)}  is  a  partition  of  the  underlying  probability  space  which 
is  measurable  with  respect  to  the  <r-algebra  generated  by  cumulative  data  set  Z.  Therefore,  we 
can  combine 36  data-to-data  hypotheses  provided  we  know  which  track  corresponds  to  which 
track  in  each  hypothesis.  For  example,  when  X,  ={r{,T2}  (i  =1,2)  represents  a  two  hypotheses  on 
a  cumulative  data  set  Z^k\  if  we  know  tJ  and  rf  originate  from  the  same  target  for  each  j  ,  then 
we  can  combine  these  two  hypotheses  into  one  hypothesis  X={r1,r2}  so  that  each  "combined” 
track  Tj  has  track  state  distribution 

(>,.(•  |Z  j* >)  =  p„(-  IZty)  P„(X,  |Z<*>)  +p„(-  IZpl)  P„(X2  |z<‘>)  .  (97) 

Having  included  a  "super"  sensor,  we  know  exactly  which  track  goes  to  which  track.  Therefore, 
we  can  actually  combine  all  the  hypotheses  at  each  k  thereby  propagating  only  one  hypothesis 
to  the  next  data  set.  Moreover,  by  manipulating  the  individual  target  state  space,  we  can  have 
different  kinds  of  target  dynamics  assuming  that  such  information  is  also  provided  by  the 
"super”  sensor.  From  another  view  point,  by  having  the  "super"  sensor,  the  targets  are  given  a 
priori  identification.  Therefore,  for  "non-super"  sensors,  the  origin  of  every  measurement  is  con¬ 
sidered  in  reference  to  such  target  identification.  This  kind  of  approach  may  be  appropriate 
when  the  targets  of  interest  are  positively  identifiable  and  their  number  is  small.  We  have  thus 
derived  an  extended  version  of  PDA  (Nj=l)  and  JPDA  (NT>0)  from  our  general  algorithm 
shown  in  Section  6.  We  also  have  shown  that,  in  an  appropriate  sense,  multitarget  tracking 
without  a  priori  identification  (labeling)  is  more  general  than  that  with  a  priori  identification. 

Like  Reid’s  algorithm,  the  PDA  and  JPDA  algorithms  assume  the  linear  gaussian  target 


36  Combining  hypotheses  is  one  of  the  critical  topics  in  implementing  any  multi-hypothesis  multitarget 
tracking  algorithm  and  will  be  treated  more  thoroughly  in  Part  II  of  this  report. 
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dynamics  and  measurement  mechanisms.  PDA  uses  so-called  "improper"  distributionj7  and 
JPDA  uses  the  Poisson  number-of-false-aJarm  assumption  (88).  Also  each  track  state  distribu¬ 
tion  is  assumed  to  be  gaussian.  When  combining  hypotheses,  track  state  distribution  of  each 
"combined"  track  is  therefore  the  sum-of-gaussian  distribution  with  a  posteriori  probabilities  of 
hypotheses  as  weights,  as  in  (97).  As  in  Reid’s  algorithm,  such  a  sum-of-gaussian  distribution  is 
then  approximated  by  a  gaussian  distribution  both  in  PDA  and  JPDA.  Therefore,  the  JPDA 
algorithm  may  be  obtained  also  from  Reid’s  algorithm  by  letting  the  newly  detected  target  den¬ 
sity  be  zero  and  by  assuming  a  "super  sensor"  as  discussed  above,  when  hypothesis  combining  is 
performed  to  its  extreme  degree  (combine-all).  However,  we  must  admit  that  the  above  discus¬ 
sions  are  slightly  artificial  and  lack  in  theoretical  rigor  by  introducing  an  artificial  "super  sen¬ 
sor."  More  rigorous  treatment  of  targets  with  a  priori  identification  (labeling)  should  be  done  by 
an  analysis  which  is  similar  to  that  in  Sections  3  and  4  but  without  a  priori  interchangeability 
assumption,  Assumption  2.  In  such  an  analysis,  we  must  evaluate  target-to-data  hypotheses 
rather  than  data-to-data  hypotheses.  Then  hypothesis  combining  is  performed  on  target-to-data 
hypotheses.  In  effect,  this  combination  is  equivalent  to  the  mixture  of  target  system  state  distri¬ 
bution  via  target-to-track  hypotheses,  which  was  discussed  in  Section  3  in  terms  of  Example  1. 
Therefore,  the  JPDA  algorithm  involves  the  two  major  approximations:  (1)  approximation  of 
sum-of-gaussian  distributions  by  gaussian  distributions  and  (2)  approximation  of  dependent  dis¬ 
tributions  (resulting  from  target-to-data  hypothesis  mixture  —  or  hypothesis  combining)  by 
independent  distributions.  To  the  best  knowledge  of  the  authors,  the  second  kind  of  approxima¬ 
tion  involved  in  the  JPDA  algorithm  has  not  been  noticed  up  to  date. 

Several  minor  differences  of  the  JPDA  algorithm  from  Reid's  algorithm  may  also  be 
observed.  One  of  the  differences  is  the  treatment  of  validation  regions.  In  Reid’s  algorithm,  the 
probability  of  a  measurement  falling  out  of  the  validation  region  of  any  track  is  ignored.  As 
mentioned  in  Section  6.1,  the  data  validation  can  be  appropriately  viewed  as  an  approximation 
in  Reid’d  algorithm.  On  the  other  hand,  in  PDA  and  JPDA  algorithms,  the  probability  of  "real" 
measurement  falling  out  the  validation  region  is  explicitly  considered.  Moreover,  a  delicate  con¬ 
ceptual  difficulty  arises  from  the  fact  that,  in  (17J  -  [19],  hypotheses  are  formed  based  upon  the 
result  of  data  validation.  This  results  in  a  slight  difference  between  the  PDA  and  JPDA  algo¬ 
rithms,  i.e.,  when  we  let  the  number  of  target  be  one,  the  JPDA  algorithm  does  not  coincide 
with  the  PDA  algorithm38.  Nonetheless,  unless  validation  regions  are  inadequately  small39,  the 
treatment  of  data  validation  should  not  affect  performance  of  any  multitarget  tracking  algo¬ 
rithm  and  such  delicate  difference  resulting  from  different  treatments  of  data  validation  should 

37  An  improper  distribution  is  a  probability  distribution  P  on  a  measure  spaoe  ( X  fi  ,p)  having  a  con¬ 
stant  density  p  on  X  with  respect  to  p  when  p(X)=oo.  However,  apparently  such  a  probability  distribu¬ 
tion  never  exists.  More  plausible  modeling  will  be  obtained  when  we  assume  that,  for  each  data  set  k  ,  the 

number-of-false-alarm  probability  function  PsFA('  |£)  is  constant  on  {0,1, . J^r^u(k)}  with  a  priori 

upper  bound  N^iX(k)  When  there  is  no  newly  detected  target  in  the  data  set,  this  upper  bound  does  not 
affect  the  hypothesis  evaluation  because  of  the  cancellation  as  far  as  the  number  of  measurement  (k ) 
does  not  occeed  Nfi^ik  ).  Therefore,  we  can  chose  any  upper  bound  even  after  the  data  set  is  received. 

38  After  adjusting  both  algorithms  due  to  the  difference  of  the  number-of-false-alarm  distribution  assump¬ 
tions. 

39  In  such  a  case,  we  can  easily  imagine  that  any  algorithm  may  not  perform  well. 
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be  negligible.  In  the  JPDA  algorithm,  the  probability  of  detection  is  assumed  to  be  a  function  of 
"targets."  However,  for  each  given  target,  the  detection  probability  is  a  constant  and  indepen¬ 
dent  of  its  state  like  in  Reid’s  algorithm.  As  recognized  widely,  both  PDA  and  JPDA  algorithms 
implicitly  assume  separate  a  "track  initiator"  which  provides  all  the  a  priori  information,  i.e.,  the 
number  of  targets  and  state  estimate  for  each  target.  Then  we  may  call  algorithms  such  as  PDA 
and  JPDA  "track  continuation."  It  is  obvious,  however,  these  two  processes,  track  initiation  and 
track  continuation,  cannot  be  carried  out  independently  in  general  cases.  Let  us  call  a  system  for 
track  continuation  a  track  extender  tentatively.  For  example,  a  newly  detected  target  may  pro¬ 
duce  a  measurement  in  a  validation  region  of  a  track  maintained  by  a  track  extender  and  may 
be  ignored  by  both  because  a  track  extender  considers  it  as  a  false  alarm  and  a  track  initiator 
considers  it  as  a  continued  track.  In  such  a  case,  both  must  negotiate.  In  [4],  Reid  shows  that 
his  algorithm  can  be  used  for  track  initiation  and  continuation  simultaneously.  The  clustering 
procedure  described  in  [4]  and  discussed  in  Part  II  of  this  report  then  provides  a  way  to  perform 
these  two  processes  separately  as  far  as  it  is  possible.  As  shown  in  Part  II  of  this  report,  how¬ 
ever,  the  Poisson  assumption  of  the  number  of  false  alarms  in  each  data  set  is  a  crucial  assump¬ 
tion  to  enable  the  clustering  procedure. 


63.  Morefield’s  Algorithm  Morefield’s  paper  [11]  motivated  us  in  selecting  appropriate 
definitions  of  tracks  and  hypotheses.  As  mentioned  before,  our  definitions  are  different  from  his 
in  that  we  have  separated  the  measurement-value  information  from  the  number-of-measurement 
information.  By  this  separation,  we  have  succeeded  in  treating  tracks  and  hypotheses 
mathematically  rigorously  as  shown  in  Sections  4  and  5.  Although  this  report  was  motivated 
considerably  by  Reid’s  paper  [4],  it  does  not  clearly  defined  tracks  and  hypotheses.  In  [17]  -  [19] 
for  the  PDA  and  JPDA  algorithms,  only  target-to-data  hypotheses  are  in  effect  considered. 
Before  discussing  Morefield’s  algorithm  which  uses  a  0-1  linear  programming  technique,  we  will 
show  a  general  batch-processing  algorithm  which  is  obtained  by  adding  the  Poisson  number-of- 
false- alarm  assumption  (88)  for  each  data  set. 

With  this  additional  assumption,  for  each  k,  false  alarm  likelihood  L£a  defined  by  (73)  in 
Section  5  becomes 

4M(z(*)|X)  =  exp(-t/M(*))  II  iy,(k)\k)  (98) 

for  each  hypothesis  X£H(Z^  where  jFA  (•,•  |k)  is  defined  by  (75)  in  Section  5  and  vFa  {k)  is  the 
expected  number  of  false  alarms  in  data  set  k.  For  each  data  set  K  and  each  track  r€T(Z^  v, 
define  the  modified  track  likelihood  by 


7/  rr(K)\  .  h  ^[HMIZj^.,,) 
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where 


€(y)  = 


if 

if  y  =9 


(100) 


Then  it  is  easily  seen  that  we  can  restate  the  result  of  Corollary  to  Theorem  2  in  Section  5  as 


p&k)&  !z(*})  =  TU(tZ{K)) 


(101) 


with 


=  Pz 0K 1)  expi^o-vx )  exp(  (* ))  (1°2) 

km 1  *“1 


for  each  K  and  each  X€H(Z^1).  In  (84)  and  (101),  the  product  over  the  empty  set  is  defined  as 
one  as  usual.  Take  the  logarithm  of  (101)  and  ignore  the  normalizing  constant.  Then  we  have  a 
function  hK  :H(Z^1)— *1R  defined  by 


hK  (X)  =  D°a hl(K)))  =  S  loaf  >))  x(r  ;  X)  (103) 

r€A  t€T(Z<k>)\{0) 


for  every  XeH(Z(A'}). 

.Thus  the  problem  of  obtaining  the  maximum  a  posteriori  probability  (MAP)  hypothesis  X 
at  K  is  equivalent  to  that  of  maximizing  fyc(X).  In  (103), 

(*(' ;  MW»>{0)  s  <o,i}T«z,">'®) 

for  each  X€H(Z(/f))  or  each  X  can  be  considered  as  a  0-1  vector  with  dimension 
#  (T  (Z(A  1)\  {0}).  Namely,  H(Z^l)  is  isomorphic  to  a  subset 


F  =  {x£{0,l}T^z<,r)^(20  xf|+xr,<l  for  all  (rj,r2)  such  that  r1^r2  and 


of  {O,l}T(Z<Ar,l'^0l-  Define  a  binary  matrix  A€{0,l}fr(z(*^^1l)x;(,r)  by 
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A(r.O  >))  =  X(0’  >  T) 

for  all  r£T  (Z^A  ^)\  {0}  and  all  (J  Then  we  have 

F  =  {x€{O,1}T(z(<,)x(0}|Ax<1} 

where  1£{0,1};C*>  is  the  all-one  binary  vector.  Thus  the  problem  of  maximizing  (101)  or 
equivalently  (103)  over  the  set  of  all  the  hypotheses  in  H(Z^)  can  be  rewritten  as  a  0-1  linear 
programming 

find  x£{0,1}t(z<*))'''^  which 
maximizes  crx 
subject  to  Ax <1 


where  cglR1^2^'^  is  defined  by 


c  r  -  io g?(r,Z(A))) 


for  all  r£T(Z(A)).  To  solve  the  above  linear  programming,  we  need  a  special  algorithm  which  is 
extensively  discussed  in  [11].  Although  we  are  not  certain,  some  of  algorithms  for  solving  classi¬ 
cal  assignment  problems  such  as  Munkres’  algorithm[20]  or  modified  Munkres’[21]  may  be  appli¬ 
cable  or  modified  to  solve  the  above  problem  formulated  as  a  linear  programming. 

The  modified  track  likelihood  can  be  recursively  calculated  as 


ck(d\z\?nJik)) 


dk=0 


else  if  y[r  |A]/0 
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for  each  k  and  each  r£T (Z^).  Thus  (101)  also  provides  algorithms  in  which  track  likelihood  for 
each  track  is  recursively  calculated  and  hypothesis  evaluation  is  performed  by  multiplication  of 
track  likelihoods.  In  [11],  like  Reid’s  (in  [4]),  PDA,  JPDA  and  other  many  algorithms,  target 
dynamics  and  measurement  mechanisms  axe  modeled  as  linear-gaussian.  Therefore,  we  can 
repeat  the  process  shown  in  Section  6.1  to  reduce  the  general  algorithm  derived  in  Section  5  to 
Moreficld’s  algorithm.  However,  two  comments  may  be  noteworthy:  (1)  The  probability  of 
detection  is  totally  ignored  in  the  track  likelihood  in  [11].  (2)  The  target  density  information  is 
also  ignored  in  [11].  When  the  probability  of  detection  is  reasonably  high  and  the  value  of  track 
likelihood  is  dominated  by  gaussian  terms,  the  above  (1)  may  not  be  any  deteriorating  factor. 
The  above  (2)  was  given  some  consideration  in  [11]  as  part  of  a  priori  information  represented 
by  P  (X)  which  we  cannot  define  and/or  calculated  easily  as  mentioned  in  the  last  paragraph  in 
Section  5.  We  should  note  that  our  formulation  in  terms  of  (101)  does  include  all  the  a  priori 
information  regarding  targets  and  sensors. 


6.4.  Goodman’s  General  Model  In  [5]  and  [8],  Goodman  describes  a  model  which  is  a  gen¬ 
eralization  of  conventional  linear-gaussian  models  and  derived  an  algorithm  based  on  it.  The 
final  results  in  both  papers  are  very  complicated  and  are  spread  out  through  over  more  than  ten 
equations.  We  will  try  to  extract  the  essence  of  his  approaches  and  results  as  much  as  possible. 
Since  the  result  in  [8]  is  incorporated  into  [5],  we  will  discuss  only  [5].  Our  assumptions, 
Assumptions  1-9  and  Assumptions  A1  -  A5,  made  in  Sections  2  and  5  are  all  shared  by  the  set 
assumptions  made  in  [5]  except  for  those  listing  below40: 

(1)  The  number  of  targets  is  possibly  infinity. 

(2)  The  detection  of  an  individual  target  is  represented  by  a  Markov  process. 

(3)  Possibly  unresolved  measurements,  i.e.,  merged  measurements  are  considered. 


One  of  the  causes  which  make  [5]  and  [8]  very  complicated  and  difficult  to  follow  is  the 
inclusion  of  the  targets’  birth-death  processes.  This  may  be  handled  more  simply  by  setting  the 
individual  target  state  space  as 

^  =  ^ geolocaiional  *  {dcud  filive  \after  (/ucre,f  component  • 

Targets  are  indexed  by  positive  integers  and  every  positive  integer  corresponds  to  a  target  which 
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may  "emerge.”  Therefore,  (9)  in  Section  2  may  be  modified  as 


If°  =  €^+  I  target  i  once  lives} 


and  N-f°=#  (/f0).  Then,  in  [5 ],  the  probability  of  Npf=oo,  i.e.,  f“=Z+,  may  not  be  zero.  This 
may  cause  some  mathematical  concern  because  IR00  is  not  a  locally  compact  space  with  the  usual 
direct -product  topology.  However,  with  an  appropriate  assumptions  on  the  birth-death  process, 
at  any  finite  time  t ,  the  number  of  targets  which  has  been  born  up  until  t  (including  already 
dead  ones)  is  always  finite.  Also  the  inclusion  of  possibility  of  merged  measurements  makes  the 
discussions  in  (5)  very  complicated.  We  will  therefore  retain  our  no-merged-measurement 
assumption  in  the  following  discussion.  The  above  assumption  (2)  in  [5]  is  rather  questionable. 
In  Assumption  (1-6)  of  in  Section  11.4  of  (5],  it  is  stated  that  the  detection  probability  of  a  live 
target  may  be  function  of  its  state  and  "typically  could  be  a  monotonically  decreasing  function 
of  the  distance  between  the  positional  components  and  the  location  of  the  centroid,  for  example, 
of  sensor  system."  This  statement  may  conflict  with  the  markovian  assumption  of  target  detec¬ 
tion.  The  authors  believe  that  the  conditional  independence  assumptions,  i.e.,  Assumption  4  in 
Section  2  and  Assumption  AS  are  more  plausible  and  realistic.  The  target  dynamics  and  meas¬ 
urement  mechanisms  in  [5]  are  modeled  by  "so-called"  event-driven  linear  systems,  i.e.,  for  target 
i ,  we  have 

dxf(t )  -  F(x?(t  )j  W(t  )dt  +  G(x?{t )/  )dw,(t)  +  u,  (xf(t )/  )di 


B 


'  v- 

*  ’'Jr 


]  • 


y?(tk)  =  H(x?(tk)jk)x,c{tk)  +  v,(tk) 


where  x{(t )  and  xf(t )  are  the  continuous  (geolocational)  and  the  discrete41  parts  of  the  j-th  tar¬ 
get,  respectively.  y,c(/k)  is  the  continuous  part  of  the  measurement  in  data  set  k  originating 
from  target  i  while  y?{tk)  have  a  certain  transition  probability  from  fa^(ffc)faty*)). 

The  uncertainty  of  measurement  origins  and  hypotheses  on  it  is  modeled  as  follows:  For 
each  data  set  k,  let  the  set  of  measurement  values  originating  from  detected  targets  and  false 
alarms  be  denoted  by 


41  {dead  plive  }  component  being  excluded. 
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« >\DT(k  ))i  6 IDT  {k)J DT  (k)) 

where  NFa  (k)  is  the  random  integer  representing  the  number  of  false  alarms  and  I DT  'be 
random  set  of  detected-target  indices.  Then,  for  each  k,  the  uncertainly  of  measurement  origins 
is  modeled  by  a  random  function  ak  which  has  formal  disjoint  union 

({1 . . ^FA  (*  )}*  {FA  })  u  ( IDT  (*  )x  {DT  }) 

as  its  domain  and  lakes  values  in 


where  Nkf(k)=N[)r(k)+Np/{  (k)=#  (Idt  (£))-+-#  (Jfa  (£))  is  the  number  of  measurement  in  data 
set  k.  Every  ctk  is  then  assumed  to  be  one-to-one.  Conditioned  by  its  domain  and  range,  every 
realization  of  o*  is  assumed  to  be  equally  probable.  For  each  data  set  K ,  define  a  random  set 
!<*>by 


IK)=Qj/cr(*)x<Dr})U<°> 


Let  J** ^  be  the  power  set  of  the  cumulative  measurement  index  set  J (-K\  i.e., 


J(Ar)={J  |JC/(*>}  . 


Define  a  random  mapping  Q^K\  from  1^  into  by 


QiK)  0)  = 


<(o*(«U)|l<A<X> 

U{MfA)Jc)\l<i<NFA(k)} 


if  •  €  U Idt(^{F>T} 

k—\ 

if  i=0 


whose  realizations  are  called  labeled  partitions.  Then  we  have 

0(A)O)  ft  0  for  all  i€l(A'M0}  , 
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Q^Ki)  fl  =  0  for  all  (ijj)^I(A:)x  I(Al  )  such  that  i?ri’  and 


ie  i*'1'  > 


Namely,  the  image  of  random  function  is  a  partition42  of  but,  as  a  function, 
attaches  labels  to  each  element  of  the  partition.  Each  label  is  either  a  target  index  (i>0)  or  a 
common  false  alarm  label.  Since  the  false  alarms  are  independent,  this  formulation  is  equivalent 
to  (0*  ,A*)  defined  by  Section  3.  In  other  words,  a  labeled  partition  in  [5]  is  another  representa¬ 
tion  of  a  target-to-data  hypothesis  defined  in  Section  3. 

Then  the  evaluation  of  each  labeled  partition,  or  equivalently  of  each  target-to-data 
hypothesis,  done  in  [5]  generally  follows  the  steps  taken  in  the  derivation  of  Reid’s  algorithm  in 
[4]  as 

P(z(k),QW\Z<k-'\Q(k-»)  =  P(z(^)|Nw()t),(2(k)^(*-V(G(*)^«(^)IC(<r'1)^(k"1))  • 

The  second  factor  on  the  right  hand  side  of  the  above  equation  is  calculated  by  the  measurement 
error  and  false  alarm  value  models  while  the  second  factor  is  subsequently  expanded  by  number 

k 

(&)=#(//**)=#  (y/y(A:))  of  targets  born  up  to  tk  and  number  Nnew{k )  of  newly  born  targets 

fail 


NT(k)Nmw(.k) 


Each  term  of  the  right  hand  side  is  then  expanded  as 

P  (Q{k)J*M (k )Ar ( k )Mnrw (k )  \Q*-'Wk-») 

=  P  (Q(<0 1  Nnt  (k  )  A$  (k )  Am  {k  )  Ar  (k  )A„,„  (k )) 
P  (tyvr  (k )  A$ (k )  Am  (k)  \Nt  (k)JJnrw (k )) 

P  (Nr  (k ) \Nnrw  0 k),Q P  ( Nnew (k )  | Q(“-V) 


Namely,  Like  in  the  derivation  of  Reid’s  algorithm  discussed  in  Section  6.1,  a  crucial  step  to 
derive  the  multitarget  tracking  algorithm  in  [5]  is  a  Bayesian  expansion  by  means  of  the  number 

42  When  the  set  of  false  alarms,  Q^K\ 0),  is  not  empty.  If  it  is  empty,  {(?^(i)  ]i€l^M0}}  is  a  partition 
oiJ{K). 
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^KT  (k )  of  newly  detected  targets,  the  number  N^rik)  of  targets  which  has  been  detected  before 
and  detected  again  at  k  and  the  number  NFA(k)  of  false  alarms.  As  mentioned  in  Section  6.1, 
this  kind  of  expansion  is  useful  only  when  the  detection  probability  is  not  functionally  related  to 
target  states.  Thus  the  comments  made  in  Section  6.1  may  be  relevant  also  to  [5]. 

As  oppose  to  Reid  j4]  and  Morefield  [11}  which  evaluate  data-to-data  hypotheses,  [5]  evalu¬ 
ates  target-to-data  hypotheses  as  in  the  PDA  and  JPDA  algorithms.  As  partitions  of  the  under¬ 
lying  probability  space  based  upon  the  available  information,  the  partition  by  target-to-data 
hypotheses  is  Gner  (sometimes  considerably  Gner)  than  that  by  data-todata  hypotheses.  This 
may  cause  both  theoretical  and  practical  problems.  As  discussed  in  detail  in  Part  II  of  this 
report,  the  number  of  hypotheses  poses  always  serious  problems  to  any  kind  of  multi-hypothesis 
tracking  algorithms.  One  of  the  solutions  is  to  combine  or  "aggregate"  hypotheses.  As  men¬ 
tioned  in  Sections  3  and  6.2,  when  hypotheses  are  combined,  we  may  introduce  cross-correlation 
among  targets,  which  sometimes  may  require  a  large  amount  of  memory.  On  the  other  hand, 
the  infinite  number  of  potential  targets  may  cause  some  theoretical  difGculty.  For  example,  sup¬ 
pose  a  sensor  observes  a  single  measurement  at  one  scan  when  no  other  data  set  has  been  gen¬ 
erated  and  when  we  know  there  is  no  false  alarm  generated  in  this  scan.  Then  we  know  at  least 
one  target  has  been  born  and  detected.  If  we  assume  that  the  number  of  potential  targets  one  of 
which  may  be  the  detected  target  is  infinite,  the  probability  of  each  target-to-data  hypothesis  is 
zero  while  we  have  one  data-to-data  hypothesis  having  probability  one.  Moreover,  the  assump¬ 
tion  that  each  newly  born  target  appears  uniformly  in  a  given  space  may  be  questionable. 


7.  CONCLUSIONS 

A  general  multitarget  tracking  problem  without  a  priori  target  identiGcation  (labeling)  was 
formulated  and  a  Bayesian  solution  to  the  problem  was  given.  This  formulation  may  be  the 
most  general  within  the  assumption  of  no  prior  identiGcation  of  targets.  We  did  exclude,  how¬ 
ever,  the  possibilities  of  merged  or  split  measurements.  Inclusion  of  either  one  of  these  possibili¬ 
ties  may  cause  considerable  complication.  We  have  shown  that,  when  targets  do  not  have  a 
priori  identiGcation,  we  need  to  evaluate  only  data-to-data  hypotheses  and  evaluation  of  target- 
to-track  hypotheses  is  unnecessary.  However,  the  consideration  of  target-to-track  hypotheses 
was  a  key  step  in  deriving  our  general  algorithm.  For  the  cases  where  several  additional 
assumptions  are  made,  called  i.i.d.  Poisson  cases,  the  general  result  was  reduced  to  a  much 
simpler  form.  From  this  form,  we  can  derive  many  specialized  forms  depending  on  the  real 
problems. 
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We  have  also  shown  that,  even  when  the  algorithm  is  restricted  to  the  i.i.d.  Poisson  cases, 
it  includes  most  of  the  representative  existing  multitargel  tracking  algorithms  as  a  proper  subset 
in  an  appropriate  sense,  thereby  providing  a  unified  view  on  multitarget  tracking  problems. 
Besides  generality,  our  theoretical  and  practical  contributions  include  the  following  two  aspects: 
First  we  considered  the  dependence  of  target  detection  on  target  states  explicitly  and  incor¬ 
porated  it  into  our  final  form  of  the  algorithm.  This  may  be  very  important  since,  to  the  best 
knowledge  of  the  authors,  this  dependence  has  been  long  neglected  in  the  multitarget  tracking 
literature  while,  in  a  practical  sense,  it  is  not  negligible  in  the  case  of  sensors  with  target  state 
dependent  detection  characteristics.  This  will  cover  many  practical  issues  such  as  radial  velocity 
dependent  detection  by  MTI-type  radars,  terrain  masking,  range  and/or  cross-section  dependent 
detection  by  radars,  radio-emission  and/or  target -activity  dependent  sensors,  etc.  Secondly  we 
defined  the  likelihood  of  a  measurement  originating  from  a  newly  detected  target  in  a  very  clear 
term.  This  provides  the  density  of  newly  detected  targets  in  Reid’s  algorithm  with  a  tran¬ 
sparent  meaning.  Moreover  this  eliminates  a  very  ambiguous  but  often  used  notion  of  Poisson 
arrival  of  newly  detected  targets.  Our  basic  viewpoint  is  that,  whenever  any  target  is  newly 
detected  by  a  sensor,  it  has  been  evading  detection  up  to  that  point  either  because  of  failed 
detection  or  because  of  zero  detection  probability,  i.e.,  because  it  has  been  out  of  field  of  view  of 
any  operating  sensor,  not  active  generating  no  detectable  signal,  not  moving,  thereby  creating  no 
radial  velocity,  etc. 

As  seen  in  Sections  4  and  5,  the  general  multitarget  tracking  algorithm  reduces  the  problem 
into  a  number  of  filtering  problems  that  are  subproblems,  in  either  general  or  i.i.d.-Poisson  cases. 
This  means  that  all  the  filtering  techniques  developed  up  until  now43  are  valuable  sources  for 
multitarget  tracking.  On  the  other  hand,  we  may  say  that  hypothesis  management  is  one  aspect 
truly  unique  to  multitarget  tracking.  By  hypothesis  management44,  we  mean  many  techniques 
to  reduce  the  number  of  hypotheses  while  maintaining  reasonable  performance  of  the  multitarget 
tracking  algorithms.  A  variety  of  techniques  have  been  developed  in  the  past  two  decades. 
They  provide  us  with  very  valuable  information  whenever  implement  at  ional  issues  of  any  algo¬ 
rithm  are  concerned.  For  example,  the  "combining"  of  all  the  target-to-data  hypotheses  in  PDA 
and  JPDA  algorithms  can  be  viewed  as  a  particular  hypothesis  management  technique.  At 
A. I.  &  D.S.  ,  we  have  developed  a  general-purpose  system  for  multitarget  tracking,  called  GTC 
(generalized  tracker  and  classifier)45,  which  has  been  implemented  in  SAIL-,  C-  and  LISP-  pro¬ 
gramming  languages.  Implementational  issues  will  be  discussed  in  Part  II  of  this  report  together 
with  several  examples  created  by  GTC. 


43  See  [22]  for  one  of  the  most  recent  survey  papers  on  this  topic. 

44  Terminology  borrowed  from  AI  (artificial  intelligence). 

45  The  "tracker"  part  represents  state  estimation  of  geolocational  entities  while  the  "dassiGer"  part  any  es¬ 
timation  of  discrete  entities.  See  [23]  for  a  concise  description  of  GTC. 
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